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FfELD OF THE INVENTION 

The present invention relates to an isolated polypeptidB having protease activity and 
6 being homologous to NoGardiopsis proteases, as we!i as isolated nucleic acid sequences 
encoding it. The invention furthermore relates to nucleic acid constructs, vectors, and host 
cells, including transgenic plants and non-human animais, comprising these nucleic acid 
sequences, as well as methods for producing and using the protease, in particular vi?ithin 
animal feed. 

10 

BACKGROUND OF THE INVENTION 

Proteases derived from Nocardiopsis sp, NRRL 18262 and Nocardiopsis dassonvillei 
NRFU- 18133 are disdosed in WO 88/03947. The DMA and amino acid sequences of the 
protease derived from Nocardiopsis sp. NRRL 18262 are shown in DK application no. 1996 
15 00013, WO 01/58276 discloses the use in anima! feed of acid-stable proteases related to the 
protease derived from Nocardiopsis sp. NRRL 18262, as wetl as a protease derived from 
NocaiWopst's alba DSM 14010. 

JP 2-255081 ~A discloses a protease derived from Nocardiopsis sp. strain OPC-210 
{PERM P-10508), how^ever without sequence infonnation. The strain is no longer available, 
20 as the deposit was withdrawn, 

DD 20043218 discloses a proteolytic preparation derived from Nocardiopsfs 
cfassonw/te/ strain ZfMET 43647, however without sequence information. The strain appears 
to be no longer avallabie. 

JP 2003284571-A, published after the first filing date of the present invention, 
25 discloses the amino acid sequence and the corresponding DNA sequence of a protease 
derived from Nocardiopsis sp. TOA-1 (PERM P-18676). The sequence has been entered in 
GEl^ESEQP with no. ADF43564. 

(t is an object of the present Invention to provide alternative proteases, in particular 
with a potentiai for use in animal feed and/or determents. 

30 

SUMMARY OF THE INVENTION 

The present inventors isolated and characterized a protease derived from 
Nocardiopsis prasina DSM 15648 (see SEQ ID NOs: 1 and 2). 

In a first aspect, the invention relates to an isolated polypeptide having protease 
35 activity, selected from the group consisting of: (a) a polypeptide having an amino acid 
sequence which has a degree of identity to amino adds 1 to 188 of SEQ ID NO: 2 of at least 
98.0%; {b) a polypeptide which is encoded by a nucleic acid sequence which hybridizes 
under medium-high stringency conditions with {i) nucleotides 486-1058 of SEQ ID NO; 1 (ii) a 
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subsequence of (i) of at least 100 nucleotides, and/or (iii) a compiementary strand of (i), (si), 
or (iii); (c) a variant of the poiypeptids having an amino acid sequence of amino acids 1 to 
188 of SEQ !D NO: 2 comprising a substitulion, deletion, extension, and/or Insertion of one or 
mors amino acids; (d) an aiielic variant of (a), (b) or (c); and (e) a fragment of (a), (b), (c), or 
5 (d) that tias protease activity. Ttie invention also relates to isolated nucleic acid sequences 
encoding sucii proteases; nucleic acid constructs, vectors, and host cells comprising ttie 
nucieic acid sequences; as well as methods for producing and using the proteases, in 
particular wifritn animal feed. 

In a second aspect the invention relates to: 

10 A. An isolated polypeptide having protease activity, selected from the group consisting 
of; (a) a polypeptide having an amino acid sequence which has a degree of identity to amino 
acids 1 to 188 of SEQ iD NO; 2 of at ieas^ 98%; (b) a polypeptide having an amino add 
sequence which has a degree of identity to amino acids -165 to 188 of SEQ ID NO; 2 of at 
least 96%; (c) a polypeptide which is encoded by a nucleic acid sequence which hybridizes 

15 under medium«hlgh stringency conditions with {i) DNA encoding a protease obtainable from 
genomic DNA from Nocardiopsis prasina DBM 15648 by use of primers SEQ ID NOS, 3 and 
4, (ii) nucleotides 496-1069 of SEQ ID NO: 1. (iii) nucleotides 1-1059 of SEQ ID NO: 1, (iv) a 
subsequence of (1) or (ii) or (Iii) of at least 100 nucleotides, or (v) a compiementafy strand of 
(t), (fj), {iii) or (Iv); (d) a variant of the polypeptide having an amino add sequence of amino 

20 acids 1 to 188, or -165 to 188 of SEQ !D NO: 2 comprising a substitution, deletion, extension, 
and/or insertion of one or more amino acids; (e) an alfelic variant of (a), (b) or (c); and (f) a 
fragment of (a), (b), (c), (d) or (e) that has protease activity; 

B, An isolated nucleic acid sequence comprising a nudeic acid sequence u'hich (a) 
encodes the polypeptide of A; (b) encodes a polypeptide having protease activity, and which 

25 hybridizes under medium-high stringency conditions with (i) DNA encoding a protease 
obtainable from genomic DNA from A/ocaraf/ops/s prasina DSM 1 5648 by use of primers SEQ 
iO NOS. 3 and 4, (Ii) nucleotides 4S6-1059 or 1-1059 of SEQ iD HO: 1, <iit) a subsequence of 
{() or ^ii) of at ieast 100 nucleotides, and/or (iv) a complementary strand of {i), (ii), or (iii); (c) 
ericodes a polypeptide having protease activity and which has a degree of identity to 

30 nucleotides 496-1 OSS SEQ ID HO: 1 of at least 97%; and/or (d) encodes a polypeptide 
having protease activity and which has a degree of Identity to nucleosides 1-1059 SEQ ID 
NO: 1 of at ieast 96%; 

C, An isolated nucieic acid sequence produced fay (a) hybridizing a DNA under medium- 
high stringency conditions vtfith (i) DNA encoding a protease obtainable from genomic DNA 

35 from Nocardiopsis pmsim DSM 15648 by use of primers SEQ ID NOS, 3 and 4; (ii) 
nucleotides 496-1059 or 1-1059 of SEQ ID NO: 1; (ill) a subsequence of (t) or (II) of at least 
100 nucleotides, or (iv) a complementary strand of (i), (ii) or (Iii); and (b) isoiating the nucieic 
acid sequence; 
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D, A nudefc add constaict comprising the nucleic add sequence of C operabiy linked to 
one or more controi sequences that direct the production of the poiypeptice in a suitable 
expression hcst; 

E, A recombinant expression vector compiising the nucleic acid construct of D; 

5 F. A recombinant host ceil comprising the nucleic acid construct of D or the vector of E; 

G. A niethod for producing a poiypeptide of A, the metiiod comprising: (a) cultivating a 
recombinant host ceii of H to produce a supernatant comprising the polypeptide; and (b) 
recovering the polypeptide; 

H. A transgenic piant, or piant part, capable of expressing the polypeptide of A; 

10 i. A transgenic, non-human anima!, or products, or eiements thereof, being capabie of 

expressing the polypeptide of A; 

J. Use of at ieast on© polypeptide as defined in A (i) in anima! feed; (ii) in the prepara- 
tion of a composition for use in anima! feed; (iii) for improving the nutritional value of an ani- 
mal feed; (iv) for increasing digestible and/or soluble protein in anima! diets; (v) for increasing 
15 tile degree of hydrolysis of proteir^s in anima! diets; and/or {vi) for the treatment of vegetable 
proteins; 

K. An animal feed additive comprising at least one poiypeptide as defined in A; and (a) 
at least one fat-soluble vitamin, and/or (b) at least one water-soluble vitamin, and/or (c) at 
least one trace mineral. 

20 t. An animal feed composition having a crude protein content of 50 to 800 g/kg and 
comprising at least one polypeptide as defined in A, or at least one feed additive of K; 
M. A composition comprising at least one polypeptide as defined in A, together with at 
ieast one other enzyme selected from amongst phytase (EC 3.1,3.8 or 3.1.3.26); xylanase 
{EC 3,2,1.8); galactanase (EC 3.2.1,89); alpha-galactosidase {EC 3,2,1.22); protease {EC 

25 3,4.-,-), phosphoiipase A1 (EC 3,1.1,32); phosphoiipase A2 (EC 3.1,1.4); iysophospholipase 
{EC 3.1,1,5); phosphoiipase C (3.1.4.3); phosphoiipase D (EC 3.1,4.4); and/or beta- 
glucanase {EC 3.2.1 .4 or EC 3.2.1 .6); 

H. Use of at least one polypeptide as defined in A in detergents. 

30 DETAILED DESCRIPTION OF THE INVENTION 

Polypeptides having protease activity, or proteases, are sometimes aiso designated 
peptidases, proteinases, peptide hydrolases, or proteolytic enzymes. Proteases may be of 
the exo-type M. hydroiyses peptides starting at either end thereof, or of the endo-type that 
act internally in polypeptide chains (endopeptldases). Endopeptidases show activity on N- 
35 and C-terminaliy blocked peptide substrates that are relevant for the specifidty of the prote- 
ase in question. 

The term "protease" is defined herein as an enzyme that hydroiyses peptide bonds. It 
inciudes any enzyme belonging to the EC 3.4 enzyme group (including each of the thirteen 
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subclasses thereof). The EC number refers to Enzyme Nomenciature 1992 from NC-iUBMB, 
Academic Press, San Diego, California, including supplements 1-5 pubiished in Eur. J- Bio- 
chem, 1994. 223, 1-5; Eur. J. Btochem. 1996. 232, 1-6; Eur. J. Biochem. 1996, 237. 1-5; Eur. 
J. Biochem. 1997, 250, 1-6; and Eur, J, Biochem. 1999, 264, 610-650; respectively. The no- 
5 menciature is regularly supplemented and updatsd; see e.g. the World Wide Web (WWW) at 
http://wvv'W.Qhem.qmw.ac.uk/!ybmb/enzyme/jndex.htmi), 

Proteases are classified on tite basis of their catalytic mechanism into the following 
groups: Serine proteases (S), Cysteine proteases (C), Aspartic proteases (A), Metalloprote- 
ases (M), and Unknown, or as yet unclassified, proteases (U), see Handbook of Proteolytic 
10 Enzymes, A.J.Barrett, N.D.Rawlings, J.F.Woessner (eds). Academic Press (t 998), in particu- 
iar Ihe general introduction part. 

In particular embodiments, the proteases of the invention and for use according to the 
invention are selected from the group consisting of: 
(a) proteases belonging to tfie EC 3.4.-.- enzyme group; 
1 5 (b) Serine proteases belonging to the S group of the above Handbook; 

(c) Serine proteases of peptidase family S2A; and/or 

(d) Serine proteases of peptidase family S1E as described in Blochem,J. 290:205-218 
(1993) and in MEROPS protease database, release 6.20, March 24, 2003, (www.me- 
rops.acuk). The database is described in Ravvlings, N.D., O'Brien, E. A. St Barrett, A.J. 

20 (2002) MEROPS: the protease database. Nucleic Acids Res. 30, 343-346. 

For determining whether a given protease is a Serine protease, and a family S2A pro- 
tease, reference is made to the above Handbook and the principles indicated therein. Such 
determination can be earned out for al! types of proteases, be it naturally occurring or wild- 
type proteases; or genetically engineered or synthetic proteases, 

25 Protease activity can be measured using any assay, in which a substrate is em- 

ployed, that includes peptide bonds relevant for the specificity of the protease in question. 
Assay-pH and assay-temperature are likewise to be adapted to the protease in question. Ex- 
amples of assay-pH-values are pH 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12. Examples of assay- 
temperatures are 30, 35, 37, 40, 45, 50, 55, 60, 65, 70, 80, 90, or 95*C. 

30 E)samples of protease substrates are casein, such as Azurine-Crosslinked Casein 

(AZCL-casein), Two protease assays are desaribed in Example 2 herein, either of wtiich can 
be used to determine protease activity. For the purposes of this invention, the so-called pNA 
Assay is a preferred assay. 

There are no limitations on the origin of the protease of ttie invention and/or for use 

35 according to the invention. Thus, the terni protease includes not oniy natum! or wild-type pro- 
teases obtained from microorganisms of any genus, but also any mutants, variants, ftag- 
ments etc. thereof exhibiting protease activity, as well as synthetic proteases, such as shuf- 
fled proteases, and consensus proteases. Such genetically engineered proteases can be 
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prepared as is genera!!y known in the art, e.g. by Site-directed yutagenesis, by PGR (using 
a PGR fragment containing tiie desired mutation as one of the primers in the PCR reactions), 
or by Random Mutagenesis. The preparation of consensus proteins is described in e.g. EP 
897985. Gene shuffling is generaify described in e.g. WO 95/22625 and WO 96/00343. Re- 
5 combination of protease genes can be mada independently of the specific sequence of the 
parents by synthetic shuffling as described in Ness, J.E, et at, in Nature Biotechnoiogy, Vol. 
20 {12), pp. 1251-1255, 2002. Syntiietic ollgonucieotides degenerated in their DNA se- 
quence to provide the possibility of all amino acids found in the set of parent proteases are 
designed and the genes assembled according to tlie reference. The shuffling can be carried 

10 out for tlie full length sequence or for only part of the sequence and then later combined with 
the rest of the gene to give a fu!i length sequence. The protease of the mature peptide parts 
of SEQ ID NO: 2 and the proteases mentioned in the background art section herein are par- 
ticular examples of such parent proteases which can be subjected to shuffling as described 
above, to provide additional proteases of the invention. The term "obtained from" as used 

15 herein in connection with a given source shall mean that the polypeptide encoded by the nu- 
cleic acid sequence is produced by the source or by a cell in which the nucleic acid se- 
quence from the source Is present In a preferred embodiment, the polypeptide Is secreted 
extraceliulariy. 

In a specific embodin^ent, the protease is a iovv-aliergenic variant, designed to Invoke 

20 a reduced immunoiogical response when exposed to animals, including man. The term im- 
munological response is to be understood as any reaction by the immune system of an ani- 
mal exposed to the protease. One t^e of fmmunological response is an allergic response 
leading to increased levels of IgE in the exposed animal. Low-aHergenic variants may be 
prepared using techniques known in the art. For example the protease may be conjugated 

25 with poiymer moieties shielding portions or epitopes of the protease involved In an immu- 
nologica! response. Conjugation with polymers may involve in vSra chemical coupling of 
poiymer to the pnatease, e.g. as described in WO 96/17929, WO 98/30682, WO 98/35026, 
and/or WO 99/00489. Conjugation may In addition or alternatively thereto involve In vivo 
coupiing of polymers to the protease. Such conjugation may be achieved by genetic engl- 

30 neering of the nucleotide sequence encoding the protease, inserting consensus sequences 
encoding additional glycosylation sites In the protease and expressing the protease in a host 
capable of gii^sosyiating the protease, see e.g. WO 00/26354, Another way of providing low- 
aiier^enic vanants is genetic engineering of the nucleotide sequence encoding the protease 
so as to cause the protease to seif-oiigomerize, effecting that protease monomers may shield 

35 the epitopes of other protease monomers and thereby lowering the antigenicity of the oli- 
gomers. Such products and their preparation Is descHbed e.g. in WO 96/16177. Epitopes in- 
volved In an Immunologicai response may be identified by various methods such as the 
phage display method described in WO 00/26230 and WO 01/83569, or the random ap- 
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proach described in EP 581907, Once an epitope has been identified, its amino acid se- 
quence may be altered to produce altered immunologica! properties of the protease by 
known gene manipulation techniques such as site directed mutagenesis (see e.g. WO 
00/26230. WO 00/26354 and/or WO 00/22103) and/or conjugation of a polymer may be don© 
6 in suffldent proximity to the epitope for the polymer to shield tiie epitope. 

In a first embodiment, tlie invention relates to isolated polypeptides comprising an 
amino acid sequence which has a degree of identity to amino acids 1 to 188 of SEQ ID NO: 
2 (the mature peptide part) of at least about 98%, and which have protease activity (hereinaf- 
ter "homologous polypeptides"), in particular embodiments, the degree of identity to amino 
10 acids 1 to 188 of SEQ ID NO: 2 is at least about 99%. In still further particuiar embodiments, 
the degree of identity to amino acids 1 to 188 of SEQ !D NO: 2 is at ieast 97.4%, 97.6%, 
97.8%, 98,0%, 98.2%, 98,4%, 98,6%, 98.8%, 99.0%, 99.2%, 99.4%, 99.6%, or at ieast 
99.8%. 

The present invention also relates lo isoiated poiypepUdes comprising an amino acid 
15 sequence \i^fch has a degree of identity to amirio acids -165 to 1S8 of SEQ ID NO: 2 of at 
feast about 96%, and which have protease activity. !n particular embodiments, the degree of 
identity to amino acids -165 to 188 of SEQ ID NO: 2 is at least about 97%, 9S%, or at ieast 
99%. 

In particular embodiments, the polypeptides of the invention i) have; or if) consist of 

20 an amino acid sequence with any of the degrees of identity as mentioned above. 

For the purposes of the present invention, the degree of identity between two amino 
acid sequences, as vwsi! as the degree of identity between two nudeotide sequences, is 
determined by the program "align" which is a Needleman-Wunsch alignment (i.e. a gjobai 
alignment). The program is used for alignment of polypeptide, as well as nucleotide 

25 sequences. The default scoring matrix BLOSUMSO Is used for polypeptide alignments, and 
the default identity matrix Is used for nucleotide alignments. The penalty for the first residue 
of a gap is -12 for polypeptides and -18 for nudeotides. The penalties for further residues of 
a gap are -2 for polypeptides, and -4 for nudeotides. The mature peptide parts, or mature 
peptide encoding parts, respectively, are used for alignment of polypeptides, and 

30 polynucleotides, respectively. 

"Align" is part of tiie FASTA package version v20u6 (see W. R. Pearson and D. J. 
Lipman (1988), "improved Tools for Biological Sequence Analysis", PNAS 85:2444-2448, 
and W, R- Pearson (1690) "Rapid and Sensitive Sequence Comparison with FASTP and 
FASTA," Methods in Enzymology 183:63-98). FASTA prxiteln alignments use the Smith- 

35 Waterman aigorithm with no iimitation on gap size (see "Smith-Watemnan algorithm", T. F. 
Smith and M. S. Waterman (1981) J. Mol. Biol. 147:195-197). 

The present invention also relates to the animal feed use of the polypeptides of the 
invention. 
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!n a particular embodiment, t>e homologous polypeptides have an amino acid 
sequence that differs by ten> or by nine, or by eight, or by seven, or by six, or by five amino 
acids, in another parlicuiar embodiment, the homologous polypeptides differ by four, or by 
three, or by two amino acids, or by one amino add from amino acids 1 to 188 of SEQ ID NO: 
5 2 or -165 to 188 of SEQ iD NO: 2. 

In a particular embodiment, the polypeptides of the present invention comprise the 
amino acid sequence of amino acids 1 to 188, or -165 to 188 of SEQ iD NO: 2, or aOeiic 
variants tliereof; or fragments thereof that have protease activity. 

In another preferred embodiment, the polypeptides of the present invention consist of 
10 amino adds 1 to 188, or -165 to 188 of SEQ \D NO; 2, or alleiic variants thereof; or 
fragments thereof that have protease activity. 

A fragment of amino acids 1 to 188, or -165 to 188 of SEQ ID NO; 2, is a polypeptide 
having one or more amino adds deleted from the amino and/or carboxyl temiinus of these 
amino acid sequences. In one embodiment a fragment contains at ieast 75 amino acid 
15 residues, or at least 1 00 amino acid residues, or at least 125 amino acid residues, or at least 
150 amino acid residues, or at least 160 amino add residues, or at ieast 165 amino add 
residues, or at ieast 170 amino acid residues, or at least 175 amino acid resldues, 

An allelic variant denotes any of two or more alternative forms of a gene occupying 
the same chromosomal locus, Aiteiic variation arises naturaily through mutation, and may 
20 rBsuit in polymorphism within populations. Gene mutations can be silent (no change in the 
encoded polypeptide) or may encode polypeptides having altered amino acid sequences. An 
allelic variant of a polypeptide is a polypeptide encoded by an ailelic variant of a gene. 

The present invention also relates to isolated polypeptides having protease activity 
and which are encoded by nucleic acid sequences whidi hybridize under very low, or low, or 
25 medium, or medium-high, or high, or very high stringency conditions with a nudeic acid 
prabe which hybridizes under ihe same conditions with (a) nucleotides 499-1062 or 1-1062 
of SEQ ID NO: 1, (b) a subsequence of (a), or {c) a complementary strand of (a), or (b) (J. 
Sambrook, E.F. Frttsdh, and T. i\4aniatts, 1989, MoleoJlar Cloning, A laboratory Manual, 2nd 
edition. Cold Spring Harbor, New York), in one particular embodiment the nucleic add probe 
30 is selected from amongst ttie nucleic add sequences of (a), (b), or (c) above. Nudeotides 
496-1059 corresponding to the mature peptide encoding part of SEQ ID NO: 1 Is a preferred 
prc>be. 

The subsequence of nucleotides 496-1059 or 1-1059 of SEQ ID NO: 1 may be at 
least 100 nucleotides, or in another embodiment at feast 200 nudeotldes. Moreover, the 
35 subsequence may encode a polypeptide fragment that has protease activity. 

The nucleic acid sequences of nucleotides 406-1059 or 1-1059 of SEQ ID NO; 1 or a 
subsequence thereof, as well as the amino acid sequences of amino acids 1 to 188, or -165 
to 188 of SEQ ID NO: 2 or a fragment thereof, may be used to design a nudeic acid probe to 
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identify and clone DNA encoding polypeptides having protease activity from strains of 
different genera or species according to methods well known in the art. in particular, such 
probes can be used for iiybridizaOon with the genomtc or cDNA of the genus or species of 
interest, foiiowing standard Southern blotting procedures, in order to identify and isolate the 
5 corresponding gene therein. Such probes can be considerably shorter tlian tlie entire 
sequence, but shouid be at ieast 15, preferably at ieast 25, and more preferably at least 35 
nucleotides in iength. Longer probes can also be used. Both DNA and RNA probes can be 
used. The probes are typically labeled for detecting the corresponding gene (for exarripte, 
with '^^P, ^H, ^^S, btotin, oravidtn). Such probes are encompassed by the present inven^on, 

10 Thus, a genomtc DNA or cDNA librar/ prepared from such other organisms may be 

screened for DNA that hybridizes with the probes described above and which encodes a 
polypeptide having protease activity. Genomic or other DNA from such other organisms may 
be separated by agarose or polyacryiamide gei eiectrophorests, or other separation 
techniques. DNA fn:>m the libraries or the separated DNA may be transferred to and 

15 immobiiized on nitrocellulose or other suitable cannier material, in order to identify a clone or 
DNA which is homologous wi^ SEQ ID NO: 1 or a subsequence thereof, the carrier material 
is used In a Southern blot. For purposes of the present invention, hybridization indicates that 
the nucleic acid sequence hybridizes to a labeled nucieic acid probe corresponding to the 
nucleic acid sequence shown in SEQ ID NO: 1, its compiemenlary strand, or a subsequence 

20 tiiereof, undef very low to very high stringency conditions- Molecules to which the nucleic 
acid probe hybridizes under these conditions are detected using X-ray film. 

In a particular embodiment, fiie nucleic acid probe is a nudeic add sequence which 
encodes amino acids 1 to 188, or -165 to 188 of SEQ ID NO: 2, or subsequences thereof. In 
another embodiment, the nucleic add probe is nucleotides 496-1069, or 1-1059 of SEQ ID 

25 NO: 1, preferably nudeotides 496-1059 {the mature polypeptide coding region of SEQ ID 
NO: 1), fn another preferred embodiment, the nucleic acid probe is the nucleic add 
sequence, or preferably the mature polypeptide coding region thereof, which is obtainable 
from genomic DNA from Nocardiopsis prasina DSM 15648, wherein the nucleic add 
sequence encodes a polypeptide having protease activity. 

30 For long probes of at ieast 100 nucleotides In length, very low to very high stringency 

conditions are defined as prehybrldization and hybridization at 42''C in 5X SSPE, 0.3% SDS, 
200 f,ig/mi sheared and denatured salmon sperm DNA, and either 25% formamide for very 
low and low stringencies, 35% formamide for medium and medium-high stringencies, or 50% 
formamide for high and very high stringencies, foiiowing standard Southern biotting 

35 procedures. 

For long probes of at least 100 nucleotides in length, the carrier materia! is finally 
washed three times each for 15 minutes using 0.2 x SSC, 0.2% SDS, 20% formamide 
preferably at least at 45*C (very low stringency), more preferafaiy at ieast at SO°C (iow 
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stringency), more preferably at least at 55°C (medium stringency), more preferably at least at 
SO^C (medium-higti stringency), even more preferably at least at BS'C {high stringency), and 
nnost preferably at ieast at 70''C (very high stringency). 

For short probes about 15 nucleotides to about 70 nucleotides in iength, stringency 
5 conditions are defined as pretiybridizalion, hybridizatton, and wasfiing post-hybridization at 
5*C to WC below the calculated !„, using the caicuiation according to Bolton and McCarthy 
(1962, Proceedings of the Nationai Academy of Sciences USA 48:1390) in 0.9 M NaCi, 0,09 
IVJ Tris-HC! pH 7.6, 6 mU EDTA, 0.5% NP-40. IX Denhardfs soiutlon, 1 mM sodium 
pyrophosphate, 1 mlVl sodium monobasic phosphate, 0,1 mlVI ATP, and 0.2 mg of yeast RNA 

10 per mi following standard Southern biottlng procedures. 

For short probes about 15 nucleotides to about 70 nucleotides in length, the carrier 
materia! is washed once in 6X SSC plus 0.1% SDS for 15 minutes and twice each for 15 
minutes using 6X SSC at 5°C to 1 0'C beiow the calculated 7^^- 

The present invention also relates to variants of the polypeptide having an amino acid 

15 sequence of amino acids 1 to 188, or -165 to 188 of SEQ ID NO; 2, comprising a 
substitution, deletion, and/or insertion of one or more amino adds. 

The amino acid sequences of the variant polypeptides may differ from the amino acid 
sequence of amino adds 1 to 188 or -165 to 188 of SEQ iD NO: 2, by an insertion or deletion 
of one or more amino acid residues and/or the substitution of one or more amino add 

20 residues by different amino acid residues. In a particular embodiment, amino acid c^ianges 
are of a minor nature, that is conservative amino add substituttons tiiat do not significantly 
affect the folding and/or activity of the protein; small deletions, typically of one to about 30 
amino acids; small amino- or carboxyl-terminal extensions, such as an amino-termlnai 
methionine residue; a smali peptide of up to about 20-25 residues; or a small extension that 

25 facilitates purification by dianglng net diarge or another function, such as a poly-histldine 
tract, an antigenic epitope or a binding domain. 

Examples of conservative substitutions are within the group of basic amino adds 
(arginine, Ij^lne and histidtne), acidic amino acids (glutamic acid and aspartic acid), polar 
amino acids (giutamine and asparagine), hydrophobic amino adds (leucine, isoleucine and 

30 valine), aromatic amino acids (phenyialantne, tryptophan and t^ffosine), and small amino 
acids (glydne, alanine, serine, threonine and methionine). Accordingly, for example, the 
invention relates to a polypeptide having, or comprising, a sequence as set forth In SEQ ID 
NO: 2, preferably amino adds 1-188 of SEQ ID NO: 2, wherein conservative amino acid 
substituttons comprise replacements, one for another, among the basic amino acids 

35 {arginine, lysine and histidine), among the acidic amino adds (glutamic acid and aspartic 
acid), among the polar amino adds (giutamine and asparagine), among the hydrophobic 
amino acids (alanine, leucine, isoleucine, and valine), among the aromatic amino acids 
(phenylalanine, tryptophan and tyrosine), and among 9ie small amino adds (glydne, alanine, 
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serine, threonine and methionine), or any combination thereof, or active fragments thereof. 
Amino acid substitutions which do not generally alter the specific activity are known in the art 
and are described, for example, by H, Neurath and R.L Hill. 1979, In, The Proteins, 
Academic Press, New Yorl<. The most commonly occurring exchanges are Ala/Ser, Val/lle, 
5 Asp/Giu, Thr/Ser, Ala/Gly, Ala/Thr, Ser/Asn, AlaA/at, Ser/Gly, Tyr/Phe, AialPro, Lys/Arg, 
Asp/Asn, Leu/I!e, LeuA/ai, Ala/Glu, and Asp/Qiy as we!! as these in reverse. 

In a particular embodiment, the polypeptides of the invention and for use according lo 
the invention are acid-stabie. For the present purposes, the term acid-stab!e means that the 
residua! activity after 2 hours of incubation at pH 3.0 and 37°C, is at least 50%, as compared 
10 to the residual activity of a corresponding sample incubated for 2 hours at pH 9.0 and 5°C. In 
a particular embodiment the residua! activity is at least 60%, 70%, 80% or at least 90%. A 
suitable assay for detemfiining acid-stability is the pH-stability assay of Example 2. 

in another particular embodiment, the polypepftides of the invention and for use 
according to the invention have a relative activity at pH 7.0 of at least 0.10, 0.15, 0.20, 0.25, 
15 0.30 or at least 0.35, The pH-profile test of Example 2 is used for the determination, 

in sttii further particular embodiments, the polypeptides of the invention and for use 
according to the invention have i) a relative activity at 60'C and pH 9 of at least 0.06, 0.10, 
0.15 or at least 0.20; and/or ii) a relative activity at TO'C of at least 0.40, 0.50, or at least 
0.60. The temperature-profile test of Exampie 2 Is used for these determinations. 
20 The polypeptide of the invention and for use according to the invention may be a 

bacterial or fungal polypeptide. The fungal polypeptide may be derived from a fiiamentous 
fungus or from a yeast. 

in parUcular embodiments, the polypeptide of the invention is i) a batSerial protease; 
ii) a pn^tease of the phylum Actinobactem; iii) of the class Actinobact&m; iv) of the oftJIer 
25 AdinomyGei^ies v) of the family Nocardliopsaceae; vi) of the genus Nocardbpsis; and/or a 
protease derived from vii) NocardSopsis species sudh as Nocardiop^s aiba, Nocardiopsis 
afkaUphila, Nocardiopsis anfarcSca, Nocardiopsis composta, Nocardiopsis exhalans, 
Nocardiopsis hahphiia, Nocardiopsis haiotoierans, Nocardiopsis kunsanensis, Nocardiopsis 
ifstari, Nocardiopsis iucBntensis, Nocardiopsis metaUicus, Nocardiopsis synnematafomans, 
30 Nocardiopsis tmhafosi, Nocardiopsis tropics, Nocardiopsis umidischoiae, Nocardiopsis 
xinjiangen&s, Nocardiopsis dassonviiiei, Nocardiopsis pras/na, for example Nocardiopsis 
prasina DSU 15648, such as a polypeptide with the amino acid sequence of amino adds 1 to 
188, or -165 to 188, of SEQ ID NO: 2, in a particular embodiment, the protease derives from 
Nocardiopsis prama. 

35 The above ^xonomy is according to the chapter: The road map to the Manual by 

G.M. Garrity & J, G. Holt in Berge/s Manual of Systematic Bacteriology, 2001, second 
edition, volume 1, David R. Bone, Richard W. Castenhoiz. 
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tt wi!t be understood that for the aforementioned speciss, the invention encompasses 
both the perfect and imperfect states, and other taxonomic equivaients, B.g., anamorphs, 
regardiess of the species name by wtiich they are known. Those skilJed in the art wi!i readily 
recogntee the identity of appropriate equivalents. 
5 Strains of these species are readily accessible to tiie public in a number of culture 

collections, such as the American Type Culture Collection (ATCC), Deutsche Sammlung von 
yikroorganismen und Zeiikulturen GmbH (DSM), Centraaibureau Voor Schimmeicuitures 
(CBS), and Agricultural Research Service Patent Culture Collection, Northern Regional 
Research Center (NRRL). E.g., Nocardiopsis pmsina DSM 15648 is publicly available from 
10 DSMZ (Deutsche Sammlung von Mikroorgantsmen und Zeiikuituren GmbH, Braunschweig, 
Germany), 

Furthermore, such polypeptides may be identified and obtained from other sources 
including microorganisms isolated from nature (e.g., soil, composts, water, etc.) using the 
above-mentioned probes. Techniques for isolating microorganisms from natural habitats are 

15 well known in the art. The nucleic acid sequence may then be derived by similarly screening 
a genomic or cDNA library of another microorganism. Once a nucleic acid sequence 
encoding a polypeptide has been detected with the probeCs), the sequence may be isolated 
or cloned by utilizing tedmiques which are known to those of ordinary ski in the art (see, 
e.g., Sambrook et al„ 1989, supra). 

20 As defined iierein, an "isolated" polypeptide Is a polypeptide which is essentially free 

of other polypeptides, e.g., at ieaai about 20% pure, preferably at least about 40% pure, 
more preferably about 60% pure, even more preferably about 80% pure, most preferably 
about 90% pure, and even most preferably about 95% pure, as determined by SDS~PAGE. 
Polypeptides encoded by nucleic acid sequences of the present invention also 

25 inciude fused polypeptides or deavable fusion polypeptides in whidi another polypeptide is 
fused at the N-termlnus or the C-terminus of the polypeptide or fragment thereof, A fused 
polypeptide is produced by fusing a nucleic acid sequence (or a portion thereof) encoding 
another polypeptide to a nucleic add sequence (or a portion thereof) of the present invention. 
Techniques for producing fusion polypeptides are known in the art, e.g. PGR, or iigating the 

30 coding sequences encoding the polypeptides so that they are in frame and that expression of 
the fused polypeptide is under control of the same promoter(s) and terminator. 

In still further particular embodiments, the invention exdudes one or more of the 
proteases derived from (i) Nocardiopsis daoscmviHei NRRL 18133 v^^hich is disdosed in WO 
88/03947; (ii) Nocafxiiopsis sp. strain OPC-210 (PERM P-10608) which is disdosed in JP 2- 

35 256081 -A; (iil) strain Zil\^ET 43647 of the species Nocardiopsis dassonviliei which is 
disdosed in DD 200432)8; (iv) Nocardiopsis sp. TOA-1 {FERiVl-P-1867e), which is disdosed 
in JP 2003284571; and/or (v) the corresponding DNA. 

Nucleic Acid Sequences 

11 
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The present invention also relates to isolated nudeic acid sequences that encode a 
polypeptide of the present invention. Particular nucleic acid sequences of the invention are 
nucleotides 496-1059, or 1-1059, of SEQ ID NO: 1, the former corresponding to tiie mature 
polypeptide encoding region. Another particular nucleic acid sequence of the invention is the 
5 sequence, preferably the mature poiypeptide encoding region thereof, which is obtainable 
from genomic DNA from Nocardhpsis prasina DSM 15648. The present invention also 
encompasses nucleic acid sequences which encode a polypeptide having the amino add 
sequence of amino acids 1 to 188, or -165 to 188, of SEQ ID NO: 2, which differ from the 
corresponding parts of SEQ ID NO; 1 by virtue of the degeneracy of the genetic code. The 
10 present invention also relates to subsequences of SEQ ID NO; 1 which encode fragments of 
SEQ ID NO: 2 that have protease activity. 

A subsequence of SEQ ID NO: 1 is a nucietc acid sequence encompassed by SEQ 
ID NO: 1 except that one or more nucleotides from the 5' and/or 3' end has been deieted. 
Preferabiy, a subsequence contains at least 225 nucleotides, more preferably at least 300 
15 nucleotides, even more pr^eratty at least 375, 450, 500, 531, 600, 700, 800, 900 or 1000 
nucieotides. 

The present invention also relates to nucleotide sequences which have a degree of 
Identity to nucleotides 496-1059 of SEQ ID NO; 1 of at ieast 97%, or a degree of identity to 
nucleotides 1-1059 of SEQ ID NO: 1 of at ieast 96%. In particular embodiments, the degree 
20 of identity to nucieotides 496-1 059 of SEQ ID NO; 1 is at least 96.6%, 96.8%, 97.0%, 97.2%, 
97.8%, 98.0%, 98.2%, 98.4%, 98.6%, 98.8%, 99.0%, 99.2%, 99.4%, 99.6%, or at (east 
99.8%. 

The present invention also relates to nucleotide sequences which have a degree of 
Identity to nucleotides 496-1059 of SEQ ID NO: 1 of at least 97%, or a degree of Identity to 

25 nucleotides 1-1059 of SEQ ID NO: 1 of at least 96%. In particular embodiments, the degree 
of idenljty is to nucleotides 496-1059 of SEQ fD NO: 1 is at least 98, or at least 99%. In other 
particular embodiments, tlie degree of Identity to nucleotides 1-1059 of SEQ ID NO: 1 is at 
least 97, 98. or at least 99%. 

The present Invention also relates to mutant nucleic acad sequences comprising at 

30 least one mutation in nucleotides 496-1059, or 1-1059, of SEQ ID NO; 1 , in whidn the mutant 
nucleic add sequence encodes a polypeptide vMdh (i) consists of amino acids 1 to 188, or - 
165 to 188 of SEQ ID NO: 2, or (ii) is a variant of any of the sequences of {i}, wherein the 
variant comprises a substitution, deletion, and/or insertion of one or more amino acids, or (ill) 
Is an allelic variant of any of the sequences of (i), or (iv) is a fragment of any of the 

35 sequences of (i). 

The techniques used to isolate or done a nudeic add sequence encoding a 
polypeptide are known in the art and include isolation from genomic DNA, preparation from 
cDNA, or a combination thereof. The cloning of tlie nucleic add sequences of the present 
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invention from such genomic DNA can be effected, e.g., by using the well l<nown polymerase 
chain reaction (PGR) or antibody so'sening of expression libraries to detect doned DNA 
fragments with shared structural features. S&s, e.g., Innis et ai., 1990, PGR: A Guide to 
yetiiods and Appiicatton, Academic Press, New York. Other nucleic acid amplification 
5 procedures such as Sigase chain reaction {LCR), ligated activated transcription (LAX) and 
nucieic acid sequence-based amplification (NASBA) may be used. The nucleic acid 
sequence may be cioned from a strain of NocBrdiopsis or another or related organism and 
thus, for example, may be an alielic or species variant of the poiypeptide encoding region of 
the nucleic acid sequence. 

10 The tenTi "isoiated nucleic acid sequence" as used herein refers to a nucleic acid 

sequence which is essentiaily free of other nucieic acid sequences, e.g., at least about 20% 
pure, preferably at least about 40% pure, more preferably at least about 60% pure, even 
more preferably at least about 80% pure, and most preferably at least about 90% pure as 
determined by agarose eiectrophoresis. For example, an isolated nucleic add sequence can 

15 be obtained by standard cloning procedures used in genetic engineering to relocate the 
nucleic add sequence from its natural location to a different site where it will be reproduced. 
The cloning procedures may involve excision and isolation of a desired nucieic add fragment 
comprising the nucleic acid sequence encoding the polypeptide, insertion of the fragment 
into a vector molecule, and incorporation of the recombinant vector Into a host cell where 

20 multiple copies or ciones of the nucieic acid sequence wil! be replicated. The nudeic acid 
sequence may be of genomic, cDNA, RNA, semisynthetic, synthetic origin, or any 
combinations thereof. 

Modification of a nudeic acid sequence encoding a poiypeptide of the present 
inven^on may b© necessary for the synthesis of polypeptides substentiaily simifar to the 

25 polypeptide. The term "substantiaily similar" to the poiypeptide refers to non-natura!ly 
occurring fomis of the polypeptide. These polypeptides may differ In some engineered way 
from the poiypeptide isolated from its native source, e.g., \^ants that dPer in spedfic 
activity, thermostability, pH optimum, allergenicity, or the like. The variant sequence may be 
constructed on the basis of the nucieic add sequence presented as the polypeptide encoding 

30 part of SEQ ID NO: 1, e.g., a subsequence thereof, and/or by introduction of nucleotide 
substitutions v\^iich do not give rise to another amino acid sequence of the polypeptide 
encoded by the nucleic add sequence, but which correspond to fiie codon usage of the host 
oiiganism intended for production of the protease, or by introduction of nucleotide 
substitutions which may give rise to a different amino acid sequence. For a genera! 

35 description of nucleotide substitution, see, e.g., Ford at a!., 1991, Protein Expression and 
Purification 2: 95-107, Low-allergenic polypeptides can e.g. be prepared as described above. 
It will be apparent to those skilled in the art that such substitutions can be made 
outside the regions critica! to the function of the molecule and stiii result in an active 
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pofypeplide. Amino acid residues essential to the activity of the polypeptide encoded by the 
isolated nucleic acid sequence of the invention, and therefore preferably not subject to 
substitution, may be identified according to procedures known In the art, such as site- 
directed mutagenesis or afanine-scannlng mLftagenesis (see, e.g., Cunningham and Weils, 
5 1989, Science 244: 1081-1085). In the latter technique, mutations are introduced at every 
positively charged residue in the moiecuSe, and the resultant mutant molecules are tested for 
protease activity to identify amino acid residues ttiat are critical to the activity of the molecule. 
Sites of substrate-protease interaction can also be determined by analysis of the three- 
dimensional structure as determined by such techniques as nuclear magnetic resonance 

10 analysis, aystaliography or photoafftnity labelling (see, e.g., de Vos et al., 1992, Science 
255: 306-312; Smith et al., 1992, Journal of Moiecular Biology 224: 809-904; Wlodaver et al, 
1992, FEBS Letters 309: 59-64). 

The present invention also relates to isolated nucleic acid sequences encoding a 
polypeptide of the present invention, v\^lch hybridize under very low stringency conditions, 

15 preferably low stringency conditions, more preferably madium stringency conditions, more 
preferably medium-high stiingsncy conditions, even more preferably high stringency 
conditions, and most preferably very high stiingency conditions with a nucleic acid probe 
whicii hybridizes under Uie same conditions with the nucleic acid sequence of SEQ ID NO: 1 
or its complementary strand; or allelic variants and subsequences thereof {Sambrx3ol< et a!., 

20 1 989, supra), as defined herein. 

The present invention also relates to Isolated nucleic acid sequences produced by (a) 
hybridizing a DNA under very iow, low, medium, medium-high, high, or very high stringency 
conditions with (I) nucleotides nucleotides 496-1059, or 1-1059, of SEQ ID NO: 1, (li) a 
subsequence of (I), or (iii) a complementary strand of {!), or (li); and (b) isolating the nucleic 

25 acid sequence. The subsequence is preferably a sequence of at least 100 nucleotides such 
as a sequence that encodes a polypeptide fragment which has protease activity. 

i^ethods for Producing l\Sutant Nucieic Acid Sequences 

The present invention further relates to methods for producing a mutant nucleic add 
30 sequence, comprising Introdudng at least one mutetion into the mature polypeptide coding 
sequence of SEQ \D NO; 1 or a subsequence thereof, wherein the mutant nucleic acid 
sequence encodes a poi^eptide which consists of amino acids 1 to 188, or -165 to 186, of 
SEQ ID NO: 2; or a fragment thereof which has protease activity. 

The introduction of a mutation into ^e nucleic acid sequence to exchange one 
35 nucleotide for another nudeotide may be accomplished by site-directed mutagenesis using 
any of the methods Icnown in the art. Particularly useful is the procedure that utiilzes a 
supercoiied, double stranded DNA vector with an insert of interest and two synthetic primers 
containing the desired mutation. The oligonucleotide primers, each complementary to 
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opposite Strands of the vector, extend diifing temperature cyciing by means of Pfu DNA 
polymerase. On incorporation of the primers, a mutated plasmid containing staggered nicks 
is generated. Following temperature cycling, tfie product is treated with Dpni which is specific 
for methylated and hemimethyiated DNA to digest the parental DNA template and to seiect 
5 for mutation-containing synthesized DNA. Other procedures known in the art may also be 
used. 



NuGieic Acid Constructs 

The present invention also relates to nucleic acid constructs comprising a nucleic acid 

10 sequence of the present invention operabiy linked to one or more control sequences that 
direct the expression of the coding sequence in a suitable host eel! under conditions 
compatible with the control sequences. Expression will be understood to include any step 
involved in tlie production of the polypeptide including, but not limited to, transcription, ppst- 
transcriptfonal modification, transiation, post-transiational modification, and secretion. 

15 "fslucleic acid construcr is defined herein as a nucleic acid molecule, either singie- or 

double-stranded, which is isolated from a naturaiiy occurring gene or which has been 
modified to contain segments of nucietc acid combined and Juxtaposed In a manner that 
would not otherwise exist in nature. The term nucleic acid construct is synonymous with the 
term expression cassette when the nucleic acid cons^ct contains ail the control sequences 

20 required for expression of a coding sequence of the present invention. The term "coding 
sequence" is defined herein as a nucleic acid sequence that directly specifies the amino add 
sequence of its protein product. The boundaries of the coding sequence are generaliy 
determined by a ribosome binding site (proioryotes) or by the ATG start codon (eukaryotes) 
located just upstream of the open reading frame at the 5' end of the mRNA and a 

25 frianscription terminator sequence located just dowistream of tf)e open reading frame at the 
3' end of the mRNA. A coding sequence can include, but is not limited to, DNA, cDNA, and 
recombinant nudeic acid sequences. 

Ar^ isoiated nudeic acid sequence encoding a polypeptide of the present inven^on 
may be manipulated In a variety of ways to provide for ejqDression of the polypeptide. 

30 Manipulation of the nucietc add sequence prior to Its insertion into a vector may be desirable 
or necessary depending on the expression vector. The techniques for modifying nucleic add 
sequences utilizing recombinant DNA methods are well known in the art. 

The imt\ "control sequences" is defined herein to tnciude all components that are 
necessary or advantageous for the expression of a polypeptide of fiie present invention. 

3S Eac*i control sequence may be native or foreign to the nudeic acid sequence encoding the 
polypeptide. Such control sequences include, but are not limited to, a ieader, potyadenyiation 
sequence, propeptide sequence, promoter, signal peptide sequence, and transcription 
temiinaior. At a minimum, the controi sequences tnciude a promoter, and transcriptional and 
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translational stop signals. The control sequences may be provided with linkers for the 
puipose of introducing specific restriction sites facilitating ligation of the contro! sequences 
with the coding region of thB nucleic acid sequence encoding a polypeptide. The term 
"operabiy iinked" is defined iienein as a configuration in wiiich a control sequence Is 
5 appropriately placed at a position relative to the coding sequence of the DNA sequence such 
tiiat the control sequence directs the expression of a polypeptide, 

Ihe control sequence may be an appropriate promoter sequence, a nucleic add 
sequence that is recognized by a host cell for expression of the nucleic acid sequence. The 
promoter sequence contains transcriptional confj'ol sequences that mediate the expression of 

10 the polypeptide. The promoter may be any nucleic acid sequence which shows 
transcriptional activity in the host cell of choice including mutant, truncated, and hybrid 
promoters, and may be obtained from genes encoding extraceHuiar or intracellular 
poiypeptides either homoiogous or heteroiogous to the host cell. 

Examples of suitable promoters for directing ihe transaiption of the nucleic acid 

15 constructs of the present invention, espedaliy in a bacterial host cell, are the promoters 
obtained fi-om the E cod lac operon. Streptomyces coeiicolor agarase gene (dagA), Bacillm 
subtHis le\«nsucrase gene (sacB), Bacillus lichenifomis aipha-amylase gene (amyL), 
Baciilus stearoUiermophilus maltogenic amylase gene (amyM), Bsdllus ainyloliqu&faciens 
aipha-amylase gene (amyQ), Bacillus licheniformis peniclliinase gene t'penP), Bacillus 

20 subtitis xyiA and xylB genes, and prokaryotic beta-lactamase gene (Viiia-Kamaroff et al., 
1978, Proceedings of the National Academy of Sciences USA 76; 3727-3731), as well as the 
tac promoter (DeBoer et a!., 1983, Proceedings of the National Academy of Sciences USA 
30: 21 -2S). Further promoters are described in "Useful proteins from recombinant bacteria" in 
Scientific American, 1980, 242; 74-94; and In Sambrook et al., 1989, supra. 

25 Examples of suitable promoters for directing the transo-iptlon of the nucleic acid 

constructs of the present invention in a filamentous fungai host ceB are promoters obtained 
from the genes for Aspergiilus oryzae TAKA amylase, Rhizcmucor miehei aspartic 
proteinase, Aspergillus niger neutral aipha-amylase, Aspergillus niger acid stabie aipha- 
amylase, AspergSlius niger or Aspergillus awamon' giucoamylase (gtaA), Rhizomucor miehei 

30 lipase, Asp&rgilfus oryzae alkaline protease, Asp^Hlus oryzsB triose phosphate (somerase, 
Aspergiiius riidulans acetamidase, and Fu^rium oxysporam trypsin-like protease {WO 
96/00787), as we!! as the NA2-tpi promoter (a hybrid of the promoters from the genes for 
Aspergiilus nigsr neutral aipha-amylase and Aspergiiius oryzae triose phosphate isomerase), 
and mutant, truncated, and hybrid promoters thereof. 

35 In a yeast host, useful promoters are obtained from the genes for Sacoharomyc&s 

mfBvisiae enoiase (EN0~1), Saoc/js/iomyces GBt&vlsiae galactokinase (GAL1), 
Saccharomyces cerews/ae alcohol dehydrogenase/glyceraldehyde-S-phosphate 
dehydrogenase (ADH2/GAP), and Sacctiaromyces Ger&miae 3-phosphog!ycerat6 kinase. 
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Other useful promoted for yeast host ceils are described by Romafios &t a/., 1992, Yeast 8: 
423-488. 

The contro! sequence may also be a suitable transcription terminator sequence, a 
sequence recognized by a fiost ceil to terminate transcription. The terminator sequence is 
5 operably iinked to the 3' terminus of the nucleic acid sequence encoding tiie poiypeptide. 
Any terminator which is functionai in the twst ceil of choice may be used in the present 
invention. 

Preferred temiinators for filamentous fungal host ceiis are obtained from the genes 
for Aspergillus oryzae TAKA amyiase, AsperyiHus niger glucoamylase, Asp&rgilius nidul&ns 
10 anthranilate synthase, Aspergillus ii/ger alpha-glucosidase, and Fusarium oxyspoivm trypsin- 
like protease. 

Preferred terminators for yeast host ceiis are obtained from the genes for 
SaccharomyoBS cerevisiae enolase, Saccharomyces cemvisiae cytochrome C {CYC1 ), and 
Saccharomyces cerevisiae g!ycera!dehyd6-3-phosphate dehydrogenase. Other useful 

1 5 terminators for yeast host cefis are described by Romanes et ai,, 1 892, supra. 

Preferred terminators for bacteria! host ceils, such as a Bacillus host ceil, are the 
terminators from BbcUIus licheniformis aipha-amylase gene (amyl), the Bacillus 
stoarotfiBrmophiius maltogentc amylase gerie (amyM), or the Badlius amyioliquefaciens 
aipha-amylase gene (amyQ). 

20 Tlie controi sequence may also be a suitable leader sequence, a nontransiated 

region of an mRNA which is important for transiation by the host cell. The leader sequence is 
operably linked to the 5' terminus of the nucleic acid sequence encoding the poiypeptide. 
Any leader sequence that is functional in the host celi of choice may be used in the present 
invention. 

25 Prefeired leaders for filamentous fungai host cells are obtained from the genes for 

A^Brgillus oryzae TAKA amyiase and Aspergiitus niduians triose phosphate isomerase. 

Suitable leaders for yeast host cells are obtained from the genes for Saccftaramyces 
cerevisiae enoiase {EiviO-1), Saccharcmyces cere}risf8e 3-phosphogiycerate kinase, 
Sacdiaromyces cerevisiae alpha-factor, and Saccharomyces oer&vlsiae alcohol 
30 dehydrogenase/glyceraldehyde-S-phosphate dehydrogenase {ADH2/GAP). 

The controi sequence may also be a polyadenylation sequence, a sequence operably 
linked to the 3' terniinus of the nucleic acid sequence and which, when transcribed, is 
recognized by the host ce!i as a signal to add polyadenoslne residues to transcribed mRNA. 
Any poiyadenylation sequence which is functionai in the host cell of choice may be used in 
35 the present invention. 

Preferred poiyadenylation sequences for filamentous fungal host ceiis are obtained 
from the genes for Aspergiilus oryzae TAKA amyiase, Aspergillus niger giucoamylasa, 
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Aspergillus nidulans anthraniiate synthase, Fusarium oxysporum trypsin-like protease, and 
Aspergillus niger aipha-glucosidase. 

Useful polyadenyiation sequences for yeast host cefis are described by Guo and 
Sherman, 199S, Moiecuiar Ceiluiar Biology 15: 5983-5990. 
5 The controi sequence may also be a signal peptide coding region fiat codes for an 

amino acid sequence linked to iiie amino terminus of a polypeptide and directs the encoded 
polypeptide into the ceii's secretory pathway. The 5' end of [he coding sequence of the 
nucleic acid sequence may inherently contain a signal peptide coding region naturally i)nS<ed 
in translation reading frame with the segment of the coding region which encodes the 

10 secreted polypeptide. Alternatively, the 5' end of the coding sequence may contain a signal 
peptide coding region which is foreign to the coding sequence. The foreign signal peptide 
coding region may be required y#iere the coding sequence does not naturally contain a 
signal peptide coding region. Alternatively, the foreign signal peptide coding region may 
simply replace the natural signai peptide coding region in order to enhance secretion of the 

15 polypeptide. However, any signal peptide coding region which directs the expressed 
polypeptide into the secretory pathway of a host ceJi of choice may be used in the present 
invention. 

Effective signal peptide coding regions for bacterial host cells are the signal peptide 
coding regions obtained from the genes for Bacillus NCIB 11837 maitogenic amylase. 

20 Baciilus stearothemophilus alpha-amylase, Bacillus tichenifaimis subtiiisin, Bacitlus 
licheniformis alpha-amylase, Bacillus stQarothBtmophiius neutral proteases (nprT, nprS, 
npriVI), and Bacillus subtilis prsA. Further signal peptides are described by Simonen and 
Pafva, 1993, IVlicrobioiogical Reviews 57: 109-137, 

Effective signai peptide coding regions forfiiamentous funga! host ce!!s are the signai 

25 peptide coding regions obtained from the genes for Aspergillus oryza& TAKA amylase, 
Aspergillus niger neutral amylase, Aspergillus niger glucoamylase, Rhizomucor miehei 
aspartic proteinase, Humicola insoi&ns celluiase, and HumicoiB lanuginosa lipase. 

Useful signal peptides for yeast host ceils are obtained from the genes for 
Saccharomyoes oerevi^e alpha-factor and Sacctistromyces cerevisiae Invertase. Other 

30 useful signai peptide coding regions are described by Romanos et af., 1992, supra. 

The conftt)! sequence may aiso be a propeptide coding region that codes for an 
amino acid sequence positioned at the amino terminus of a polypeptide. "Tlie resultant 
polypeptide Is known as a pnsenzyme or prqpolypeptide (or a zymogen in some cases). A 
propoiypeptfde is generally inactive and can be converted to a mature active polypeptide by 

36 catalytic or autocatalytic cleavage of Uie propeptide from the propolypeptide. The propeptide 
coding region may be obteined from the genes for Bacillus subttlis ailtaline protease (aprE), 
Bacillus SI/M//S neutral protease (nprT), Sacohammyces cerevisiae alpha-factor, Rhizomucor 
m/efte/aspartic proteinase, and MycefiopMiora ihermophlla laccas© (WO 95/33836), 
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In a preferred embodiment, the propeptide coding region is nucleotides 1-495 of SEQ 
ID NO: 1 whici^ encodes amino acids -165 to -1 of SEQ ID NO: 2. 

Where both signaf peptide and propeptide regions are present at tiie amino terminus 
of a polypeptide, tiie propeptide region Is positioned next to the amino terminus of a 
5 poiypeptide and the signal peptide region is positioned next to the amino terminus of the 
propeptide region. 

it may also be desirable to add regulatory sequences which allow the reguiatlon of 
the expression of the polypeptide relative to the growth of the host cell. Examples of 
regulatory systems are those which cause th& expression of the gene to be turned on or off 

10 in response to a chemical or physical stimulus, including the presence of a regulatory 
compound. Regulatory systems In prokaryotic systems include the lac, tac, and trp operator 
systems. In yeast, the ADH2 system or GAL1 system may be used, !n filamentous fungi, tiie 
TAKA alpha-amylase promoter, AspergiHus niger glucoamylase promoter, and Aspergillus 
oryiae glucoamylase promoter may be used as regulatory sequences. Other examples of 

15 regulatory sequences are those which allow for gene amplification. In eukaryotic systems, 
these indude the dihydrofoJate reductase gene which is amplified in the presence of 
methotrexate, and the metaliothionein genes which are amplified with heavy metals. In these 
cases, the nucleic acid sequence encoding the poiypeptide wouid be operabiy linked with the 
regulatory sequence, 

20 

Expression Vectors 

The present invention also relates to recombinant expression vectors comprising a 
nucleic acid sequence of the present invention, a promoter, and transcriptional and 
faransiationa! stop signals. The various nucieic acid and control sequences described above 

25 may be Joined together to produce a recombinant expression vector which may include one 
or more convenient restriction sites to allow for insertion or substitution of the nudeic acid 
sequence encoding the polypeptide at such sites. Alternatively, the nucleic acid sequence of 
the present Invention may be expressed by inserting the nudeic acid sequence or a nucieic 
acid constmct comprising the sequence into an appropriate vector for ei^resslon. In creating 

30 the expression vector, the coding sequence is iocated in the vector so that the coding 
sequence is operabiy Jinked with the appropriate a>ntrol sequences for expression. 

The recombinant expression vector may be any vector {e.g., a piasmid or virus) which 
can be conveniently subjected to recombinant DNA pro(^dures and can bring about the 
expression of the nucleic acid sequence. The choice of the vector wjli typically depend on the 

35 compatibiiity of the vector with the host ceii into which the vector is to be introduced. The 
vectors may be linear or closed circuiar plasmids. 

The vector may be an autonomously replicating vector, i.e., a vector which exists as 
an extrachromosomai entity, the replication of which is independent of diromosomai 
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repiication, e.g., a piasmfd, an extrachromosomaS element, a minichromosome, or an artificial 
chromosome. The vector may contain any means for assuring seif-repiication. Aiternatively, 
the vector may be ons which, when introduced into the host celS, is integrated into the 
genome and replicated together with the chromosome(s) into which (t has been integrated. 
5 Furthemiore, a siogie vector or plasmid or two or more vectors or piasmids which together 
contain the total DMA to be introduced into the genome of the host ce!i, or a transposon may 
be used. 

The vectors of the present invention preferably contain one or more seiectabie 
markers which permit easy selection of transfomied ceiis. A seiectabie marker is a gene the 

10 product of which provides for biocide or viral resistance, resistance to heavy metals, 
prototrophy to auxotrophs, and the !i!<e. Examples of bacteria! selectable mariners are the dal 
genes from Bacillus subtHis or Bacillus lictieniformis. Suitable matters for yeast host cetis are 
ADE2, HIS3, LEU2, LYS2, IV4ET3, TRP1, and URA3. Seiectabie markers for use in a 
fiiamentous fungai host ceil include, but are not limited to, amdS (acetamidase), argB 

15 (ornithine carbamoyltransferase), bar (phosphtnothricin acetyltransferase), hygB {hygromycih 
phosphotransferase), niaD (nitrate reductase), pyrG (orot!dine-5'-phosphate decarboxyiase), 
sC (.'iiilfafe adenyitransferase). trpC (anthraniiate synthase), as weii as equivaients thereof. 
PrefeTOd for use in an Aspergillus cell are the amdS and pyrG genes of Aspeigillus nidulans 
or Aspeiyillus oryzae and the bar gene of Streptomyces hygroscopicus. 

20 The vectors of the present invention preferably contain an element{s) that permits 

stable integration of the vector into the host ceifs genome or autonomous repfication of the 
vector in the ceii independent cf the genome. 

For integration into iihe host celi genome, the vector may rely on the nucleic add 
sequence encoding the poiypeptide or any other element of the vector for stabie integration 

25 of the vector into the gencst^e by homoiogous or nonhomoiogous recombination. 
Alternatively, the vector may contain additional nucleic acid sequences for directing 
integration by homologous recombination into the genome of the host cell. The additional 
nucleic acid sequences enable the vector to be integrated into the host ceif genome at a 
precise iocation{s) in the chromosom6{s). To increase the liiceiihood of integration at a 

30 precise location, the integratlonal elements should preferably contain a sufficient number of 
nucleic acids, sudi as 100 to 1,600 base pairs, preferably 400 to 1,500 base pairs, and most 
preferably SCO to 1,500 base pairs, which are highly homologous with the corresponding 
target sequence to enhance the probability of homoiogous recombination. The integrational 
elements may be any sequence that is homologous with the target sequence in the genome 

35 of the host ceii. Furthemiore, the integrational eiements may be non-encoding or encoding 
nucleic acid sequences. On the other hand, the vector may be integrated into the genome of 
the host celi by non-homologous recombination. 
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For autonomous replication, the vector may further comprise an origin of replication 
enabling the vector to replicate autonomously in the host cell sn question. Examples of 
bacteriai origins of replication are the origins of repiication of plasmids pBR322, pUC19, 
pACYCirr, and pACYC184 permitting replication in E co//, and pUBliO, pE194, pTA1060, 
5 and pAMfJI permitting replication in Baciilus. Examples of origins of repiication for use in a 
yeast host eel! are the 2 micron origin of repiication, ARS1, ARS4, the combination of ARS1 
and CEN3, and the combination of ARS4 and CEN6. The origin of repiication may be one 
having a mutation which makes its functioning temperature-sensitive in the host ceil (see, 
e.g,, Ehrlich, 1978, Proceedings of the National Academy of Sciences USA 75: 1433). 

10 yore than one copy of a nucleic acid sequence of the present invention may be 

inserted into the host celi to inaease production of the gene product. An increase in the copy 
number of the nucleic acid sequence can be obtained by integrating at least one additional 
copy of the sequence into the host eel! genome or by including an ampiifiabie selectable 
marker gene witli the nucleic acid sequence where ceiis containing amplified copies of the 

16 selectable marker gene, and thereby additional copies of the nucleic acid sequence, can be 
selected for by cultivating the ceils in the presence of the appropriate selectable agent. 

The procedures used to iigate the elements desaibed above to construct the 
recombinant expression vectors of the present invention are weii known to one skiifed in the 
art {see, e.g., Sambrook et al.. 1 989, supra). 

20 The protease may also be co-expressed together with at least one other enzyme of 

interest for animal feed, such as phytase (EC 3.1.3,8 or 3.1,3.26); xyianase (EC 3.2.1.8); 
galactanase (EC 3.2.1,89); alpha-gaiactosidase (EC 3,2,1.22); protease (EC 3,4,-,-), 
phospholipase Al (EC 3.1.1.32); phospholipase A2 (EC 3,1.14); iysophospholipase (EC 
3,1.1.5); phospholipase C (3,1.4,3); piiospholipase D (EC 3.1,4,4); am^ase, for example 

25 aipha-amylase (E.C. 3.2.1 . 1 ); and/or beta-g!ucanase (EC 3,2. 1 ,4 or EC 3.2.1 ,6). 

The enz^es may be co-expressed from different vectors, from one vector, or using a 
mixture of both techniques. When using different vectors, the vectors may have different 
selectable markers, and different origins of replication. When using only one vector, the 
genes can be expressed from one or more promotere. If doned under the regulation of one 

30 promoter (di- or muKi-cistronic), the order in wliich the genes are cloned may affect the 
expression levels of the proteins. The protease may also be es^ressed as a fusion protein, 
j,e. that the gene encoding the protease has been fused In frame to the gene encoding 
another protein. This protein may be another enzyme or a functional domain from another 
enzyme, 

35 

Host Cells 

The pmsent Invention also relates to recombinant host cells, comprising a nucleic 
acid sequence of the Invention, which are advantageously used in the recombinant 
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production of the polypeptides. A vector comprising a nucleic acid sequence of the present 
invention is introduced into a host cell so that the vector is maintained as a chromosomal 
integrant or as a seif-repiicating extra-chromosomal vector as described earler. The term 
"host eel!" encompasses any progeny of a parent cell that Is not identical to the parent ceil 
5 due to mutations that occur during repiication. The choice of a host ceil will to a large extent 
depend upon the gene encoding the poiypepiide and its source. 

The host ceil may be a unicellular microorganism, &,g„ a prokaryote, or a non- 
unicellular microorganism, e.g. , a eukaryote. 

Useful uniceliuiar ceiis are bacterial cells such as gram positive bacteria including, but 
10 not limited to, a Bacillus ceii, or a Streptomyces cell, or cells of lactic add bacteria; or gram 
negative bacteria such as £. coli and Pseudomonas sp. Lactic acid bacteria include, but are 
not limited to, species of the genera LactOGoocus, LaGtobacfllus, LeuconoBtoo, 
St&ptococcus, Pediococcus, and Ent&rocoGcus. 

The Introduction of a vector into a bacteria! host cell may, for instance, be effected by 
15 protopiast transformation (see, e.g., Chang and Cohen, 1978, Molecular General Genetics 
168: 111-115), using competent cells (see, e.g., Young and Spiztein, 1961, Journal of 
Bacteriology 81: 823-829, or Dubnau and Davidoff-Abeison, 1971, Journal of Iviotecular 
Biology 56: 203-221), electroporation {see, e.g., Shigekawa and Dower, 1888, Biotechniques 
6: 742-751), or conjugation (see, e.g., Koehler and Thome, 1987, Journal of Bacteriology 
20 169:5771-5278). 

The host ceil may be a eukaryote, such as a non-human animal cell, an insect cell, a 
plant cell, or a fungal cell 

in one particular embodiment, the host cell is a fijngal cell. "Fungi" as used herein 
includes tiie pfiyia Ascomycota, Basidiomycota, Chytridiomyoota, and Zygomycota (as 
25 defined by Hawicsworth et al.. In, AInsworth and Bisby's Dictionary of The Fungi, 8th edition, 
1995, CAB international. University Press. Cambridge, UK) as well as the Oomycota (as 
cited in Hawicsworth et al„ 1995, supra, page 171) and a)i mttosporic fungi (Hawicsworth et 
al., 1995, supra), 

in another particular embodiment, the fungal host ceii is a yeast ceii, "Yeasf as used 
30 herein indudes ascosporogenous yeast (Endomycetaies), basidlosporogenous yeast, and 
yeast belonging to the Fungi imperfect! (Blastomycetes). Since the classification of yeast 
may change in the future, for the pui^oses of this invention, yeast shall be defined as 
described in Biology and Activities of Yeast (Slcinner, F.A., Passmore, S.IVl., and Davenport, 
R.R., eds, Soc. App. Bacteiiol. Symposium Series No, 9, 1980). 
35 The yeast host ceil may be a Candida, Hansenuia, Kluyveromyces, Pichia, 

Saocharomyces, SchizosBochammyces, or Yarrowia cell. 

The fungal host cell may be a fiiamentous fungal cetJ. "Filamentous fungi" include al! 
filamentous forms of the subdivision Eumycota and Oomycota (as defined by Hawl<sworth et 
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al., 1995, supra). The filamentous fungi are characterized by a mycaiial wall composed of 
chitin, ceiiulose, glucan, chitosan, mannan, and other complex polysaccharides. Vegetative 
growth is by hypha! elongation and carbon catabolssm is obiigately aerobic, in contrast 
vegetative growth by yeasts such as Saccharomyces cerevisiae is by budding of a uniceiluiar 
5 thalius and carbon catabolism may be fermentative. 

Examples of filamentous fungai host cells are cells of species of, but not limited to, 
Acremonium> Aspetyilius, Fusarium, Humicola, Mucor, Myceiiophthom, Neurospom, 
Penicillium, Thielavia, Tolypodadkm, or Tiichoderma. 

Fungal cells may be transformed by a process involving protoplast formation, 

10 transfonnatfon of tlie protoplasts, and regeneration of the cell wall in a manner known perse. 
Suitable procedures for transformation of Aspergillus host cells are described in EP 238 023 
and Yelton et ai., 1984, Proceedings of the National Academy of Sciences USA 81: 1470- 
1474. Suitable methods for transforming Fusarium species are described by Malardieret a!., 
1989, Gene 78: 147-156 and WO 96/00787. Yeast may be transformed using the procedures 

15 described by Becker and Guarente, in Abelson, J.N. and Simon, M.L, editors, Guide to Yeast 
Genetics and Molecular Bioiogy, Methods in Enzymology, Volume 194, pp 182-187, 
Academic Press, inc., New York; Ito et al., 1983. Journal of Bacteriology 153: 163; and 
Hinnen et al, 1978, Proceedings of tie National Academy of Sciences USA 75; 1920. 

20 Methods of Production 

The present invention also relates to methods for producing a polypeptide of the 
present invention comprising (a) cultivating a strain, v;hich in its wild-type form is capabie of 
producing tlie polypeptide; and (b) recovering the polypeptide. Preferably, the strain is of the 
genus Nooardfopsis, more preferably Nocardfopsis dasscmvUies, Nocafdiopsis alba, 

26 Nocardiopsis prasina or Nocardiopsis antarctica. 

The present invention also relates to methods for producing a poiypeptide of the 
present invention comprising (a) cultivating a host ceil under conditions conducive for 
production of the polypeptide; and (b) recovering Hie polypeptide. 

The present Invention also relates to mettiods for producing a poiypeptide of the 

30 present invention comprising (a) cultivating a host cell under conditions conducive for 
production of the poiypeptide, wherein the host cei! comprises a mutant nucleic acid 
sequence comprising at least one mutation in nucleotides 496-1059, or 1-1059, of SEQ ID 
NO: 1, in which the mutant nucleic acid sequence encodes a polypeptide which (i) consists of 
amino adds 1 to 188, or -166 to 188, of SEQ ID NO: 2, or (ii) is a variant of any of the 

35 sequences of (i), wherein the variant comprises a substitution, deletion, and/or insertion of 
one or more amino acids, or (iii) is an allelic variant of any of the sequences of {i), or (iv) is a 
fragment of any of the sequences of {i). 
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In the production methods of the present invention, the cells are cultivated in a 
nutrient medium suitable for production of the polypeptide using methods knom^n in the art. 
For exampie, the ceii may be cultivated by shake flask cuitivation, smali-scale or large-scale 
fermentation {including continuous, batch, fed-batch, or soiid state fermentations) in 
5 laboratory or industrial fermentors performed in a suitable medium and under conditions 
allowing the polypeptide to be expressed and/or isolated. The cultivation ta!<es place in a 
suitable nutrient medium comprising carbon and nitrogen sources and inorganic salts, using 
procedures known in the art. Suitable media are available from commercial suppliers or may 
be prepared according to published compositions (e.g., in catalogues of the Anwican Type 

10 Culture Collection), If the polypeptide is secreted into the nutrient medium, the polypeptide 
can be recovered directly from the medium. If the polypeptide is not secreted, it can be 
recovered from cell iysates. 

The polypeptides may be detected using methods known in the art that are specific 
for the polypeptides. These detection methods may include use of specific antibodies, 

15 formation of a product, or disappearance of a substrate. For example, a protease assay may 
be used to determine the activity of the polypeptide as desaibed herein. 

The resulting polypeptide may be recovered by methods known in the art. For 
example, the polypeptide may be recovered from the nutrient medium by conventional 
procedures including, but not limited to, centrifugation, filtration, ejdraction, spray-drying, 

20 evaporation, or precipitation. 

The polypeptides of the present invention may be purified by a variety of procedures 
known in the art includirig, but not limited to, diromatography (e.g., ion exchange, affinity, 
hydrophobic, chromatofocusing, and size exclusion), electrophoretic procedures (e.g., 
preparative Isoelectric focusing), differential solubility (e.g., ammonium sulfate precipitation), 

25 SDS-PAGE, or extraction (see, s,g„ Protein Purification, J.-C. Janson and Lars Ryden, 
editors, VCH Publishers, New York, 1989), 

Plants 

The present invention also relates to a transgenic plant, plant part, or plant cell 
30 which has been transformed with a nucleic acid sequence encoding a polypeptide having 
protease activity of the present invention so as to express and produce the polypeptide in 
recoverable quantities. The polypeptide may be recovered from the plant or plant part. 
Alternatively, the plant or plant part containing the recombinant polypeptide may be used as 
such for improving the quality of a food or feed, e.g., improving nutritional value, palatabiltty, 
35 and rheologlcai properties, or to destroy an antlnutritive fat^or. 

In a particular embodiment, the polypeptide is targeted to the endosperm storage 
vacuoles in seeds. This can be obtained by synthesizing it as a precursor with a suitable 
signal peptide, see Horvath et a! in PNAS, Feb. 15, 2000, vol. 97, no. 4, p. 1914-1919. 
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The transgenic plant can be dicoti'ladonous (a dicot) or monocotyledonous (a 
monocot) or engineered variants thereof. Examples of monocot plants are grasses, such as 
meadow grass (blue grass, Poa), forage grass such as Festuca, Loli'um, temperate grass, 
such as Agrostis, and cereafs, e.g., wheat, oats, rye, barley, rice, sorghum, triticaie 
5 (slabfiized hybrid of wheat {Triticam) and rye (Secale), and maize (corn). Examples of dicot 
plants are tobacco, legumes, such as sunflower (Helianthus), cotton (Gossypium), lupins, 
potato, sugar beet, pea, bean and soybean, and cruciferous plants (family Brassicaceae), 
such as cauliflower, rape seed, and the closely related model organism Ambidopsis thaliana. 
Low-phytate plants as descnbed e.g. in US patent no. 5,689,054 and US patent no. 
10 6,11 1,168 are examples of engineered plants. 

Examples of plant parts are stem, calius, leaves, root, fruits, seeds, and tubers, as 
well as the individual tissues comprising these parls, e.g. epidemiis, mesophyll, parenctiyma, 
vascular tissues, meristems. Also specific plant celi compartments, such as chloroplast, 
apoplast, mitochondria, vacuole, peroxisomes, and cytoplasm are considered to be a plant 
1 5 part. Furthermore, any plant cell, whatever the tissue origin, is considered to be a piant part. 
Likewise, plant parts such as specific tissues and cells Isolated to facilitate the utiilsation of 
the invention are aiso considered plant parts, e.g. embryos, endosperms, aleurone and seed 
coats. 

Aiso included within the scope of the present invention are the prc^eny of such 
20 piants, plant parts and plant ceils. 

The transgenic plant or pjant cell expressing a polypeptide of the present invention 
may be constructed in accordance with methods known in the art. Briefly, the piant or piant 
celi is constructed by incorporating one or more expression constructs encoding a 
polypeptide of the present invention Into the piant host genome and propagating the resulting 
25 modified plant or plant celi into a transgenic plant or plant cell. 

Convenientiy, the expression construct is a nucleic acid construct which comprises a 
nucieic acid sequence encoding a pdypeptide of the present Invention operably linked with 
appropriate regulatory sequences required for expression of the nucleic acid sequence in the 
piant or plant part of Qho\Q&. Furthermore, the expression construct may comprise a 
30 selectable marker useful for Identifying host ceils into which the expression construct has 
been integrated and DNA sequences necessary for introduction of the construct into the 
plant in question (the latter depends on the DNA introduction method to be used). 

The choice of regulatory sequences, such as promoter and temilnator sequences 
and optionally signal or transit sequences are determined, for example, on the basis of \A^en, 
35 where, and how the polypeptide is desired to be expressed. For Instance, the esqsression of 
the gene encoding a polypeptide of the present invention may be constitutive or inducible, or 
may be developmental, stage or tissue specific, and the gene product may be targeted to a 
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specific C9!i compartment, tissue or plant part such as seeds or leaves. Regulatory 
sequences are, for exampie, described by Tague et ai., 1988, PSant Piiysioiogy 86; 606. 

For constttutive expression, the following promoters may be used: The 35S-CaMV 
promoter (Franck et al.. 1980, Cel! 21: 285-294), the maize ubiquitin 1 {Cliristensen AH. 
6 Sharrock RA and Quail 1992. Maize poiyubiquttio genes: structure, ttiemiai perturbation of 
expression and transcript splicing, and promoter activity following transfer to protoplasts by 
electroporation), or the rice actin 1 promoter (Plant Mo. Bioi. 18, 675-689,; Zhang W, McElroy 
D. and Wu R 1991, Analysis of rice Act1 S' region activity in transgenic rice plants. Plant Cell 
3, 1155-1 165). Organ-specific promoters may be, for exampie, a promoter from storage sink 

10 tissues such as seeds, potato tubers, and fruits (Edwards & Coruzsi, 1990, Ann, Rev. Genet 
24: 275-303), or from metaboiic sink tissues such as meristems (Ito et al., 1994, Piant Mol, 
Biol, 24; 863-878), a seed specific promoter such as the gluteiin, prolamin, globulin, or 
albumin promoter from rice (Wa et aL, 1998, Plan! and Ceii Physiology 39: 885-889), a Vicia 
faba promoter from the iegumin B4 and the uni(nown seed protein gene from Vicia fafaa 

15 (Conrad et al., 1998, Journal of Plant Physiology 152: 708-711), a promoter from a seed oil 
body protein (Chen et at, 1998. Plant and Cell Physiology 38: 935-941), the storage protein 
napA promoter from Brassica napus, or any other seed spedfic promoter known in the art, 
e.g., as described in WO 91/14772, Furthermore, the promoter may be a leaf specific 
promoter such as the rbcs promoter from rice or tomato (Kyozuka et ai, 1993, Piant 

20 Physioiogy 102: 991-1000, the chlorella virus adenine methyitransferase gene promoter 
{Mttra and Higgins, 1994, Piant Molecular Biology 26: 85-93), or the aldP gene promoter from 
rice (Kagaya et al., 1996, Molecular and General Genetics 248: 668-674), or a wound 
inducible promoter such as the potato pin2 promoter (Xu et al., 1993. Plant Molecular Biology 
22: 573-588). Likewise, the promoter may be inducible by abiotic treatments such as 

25 temperature, drought or alterations in salinity or inducible by exogenously applied 
substances that activate the promoter, e>g. ethanol, oestrogens, plant hormones like 
ethylene, abscisic acid, gibbereliic acid, and/or heavy metals. 

A promoter enhancer element may also be used to achieve higher expression of the 
the protease In the plant For instance, the promoter enhancer element be an intron 

30 which is placed between the promoter and the nucleotide sequence encoding a polypeptide 
of the present invention. For instance, Xu et al., 1993, supra disclose the use of the first 
intron of the rice actin 1 gene to enhance expression. 

Still further, the osdon usage may be optimized for the piant species in question to 
improve expression (see Horvath et ai referred to above). 

35 Th& selectable marker gene and any other parts of the expression construct may be 

chosen from those available in the art. 

The nudeic add consfruct is incorporated into the piant genome according to 
conventional techniques known in the art, including Agrobadenum-Txmdiated transformation, 
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virys-mediated transformation, microinjection, particle bombardment, bioiistic transformation, 
and electroporation (Gasseret ai., 1990, Science 244: 1293; Potrykus, 1990, Bio/Techno!ogy 
8: 535; Shimamoto et al., 1989, Nature 338: 274), 

Presenfiy, Agmbacteiium tumefaciens-med'mted gene transfer is the mettoci of 
5 clioice for generating fransgenic dicots (for a review, see Hooykas and Sciiilperoort, 1992, 
Riant Molecular Biofogy 19: 15-38), and it can also be used for transforming monocots, 
aithougti other transformation methods are more often used for these plants. Presently, the 
method of choice for generating transgenic monocots, suppiementing the Agrobacterium 
approach, is particle bombardment {microscopic goid or tungsten particies coated with the 
10 transforming DNA) of embryonic ca!!i or developing embryos (Christou, 1992, Plant Journai 
2: 275-281; Shimamoto, 1994, Current Opinion Biotechnology 5: 168-162; Vasil et ai., 1992, 
Bio/Technology 10; 667-674). An alternative method for transformation of monocots is based 
on protoplast transtbrmation as described by .Ominilleh et at, 1993. Plant iMolecular Biology 
21:415-428. 

15 Following transformation, the transfonriants having incorporated therein the 

expression construct are selected and regenerated into whole plants according to methods 
weli~l<nown in the art. Often the transformation procedure is designed for the selective 
elimination of selection genes either during regeneration or in the following generations by 
using e.g. co-transformation with two separate T-DNA constructs or site specific excision of 

20 the selection gene by a specific recombinase. 

The present invention also relates to methods for producing a polypeptide of the 
present invention comprising (a) cultivating a transgenic plant or a plant ceii comprising a 
nudeic acid sequence encoding a polypeptide having protease activity of the present 
invention under conditions conducive for production of the polypeptide; and (b) recovering 

25 the polypeptide. 

The present invention also relates to methods for producing a poiypeptide of the 
present invention comprising (a) cultivating a transgenic piant or a piant cell comprising a 
nudeic add sequence encoding a polypeptide having protease activity of the present 
Invention under conditions conducive for production of the polypeptide; and (b) recovering 

30 tfie polypeptide. 



Animais 

The present invention also relates to a transgenic, non-human animal and products 
or elements thereof, examples of which are body fluids such as millt and blood, organs, flesh, 
3S and animal cells. Techniques for expressing proteins, e.g. in mammailan ceils, are known in 
the art, see e.g, the handboo!< Protein Expression: A Practical Approach, Higgins and Names 
(eds), Oxford Univerei^ Press (1999), and the three other handbooks in this series relating to 
Gene Transcription, RNA processing, and Post-translationai Processing. Generally speaiting, 
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to prepare a transgenic animai, selected ceJis of a selected animal are transformed with a 
nudeic acid sequence encoding a polypeptida having protease activity of the present 
invention so as to express and produce the polypeptide. The polypeptide may be recovered 
from the animal, e.g. from the milk of female animals, or the polypeptide may be expressed 
5 to the benefit of the animal itself, e.g. to assist the animal's digestion. Examptes of animais 
are mentioned below in the section headed Animal Feed, 

To produce a transgenic animal with a view to recovering the protease from the rnilk 
of the animal, a gene encoding the protease may be inserted into the fertiiized eggs of an 
ar^imal in question, e.g. by use of a transgene expression vector which comprises a suitabie 

10 mi!k protein promoter, and the gene encodirig the protease. The transgene expression vector 
Is micfolnjected into fertilized eggs, and preferably permanently integrated into the 
chromosome. Once the egg begins to grow and divide, the potentiai embryo is implanted into 
a surrogate mother, and animais carrying the transgene are identified. The resuiting anima! 
can then be muftipiied by conventtonai breeding. The polypeptide may be purified from the 

15 animal's milk, see e.g. Meade, H.y. et al (1999): Expression of recombinant proteins in the 
milk of transgenic animals, Gene expression systems; Using nature for the art of expression, 
J. M. Fernandez and J. P. Hoeffier (eds.), Academic Press. 

in the alternative, in order to produce a transgenic non-human animaf that carries in 
the genome of its somatic and/or germ cells a nudeic add sequence induding a 

20 heterologous transgene construct including a transgene encoding the protease, the 
fe^nsgene may be operabiy linked to a first regulatory sequence for saiivary gland specific 
expression of the protease, as disclosed in WO 2000064247. 

Compositions 

25 In a stiil further aspect, the present invention reiates to compositions comprising a 

poiypeptide of the present invention. 

The poiypeptide compositions may be prepared in accordance wifti methods knov^n 

in the art and may be in the form of a liquid or a dry composition. For Instance, the 

polypeptide composition may be In the form of a granulate or a micrc^ranulate. The 
30 polypepfide to be included in the composition may be stabiiized in accordance with methods 

itnown in the art. 

Exampies are given below of prefen^d uses of the polypeptides or poiypeptide 
compositions of the Invention, 



35 Animal Feed 

The present invention Is also directed to methods for using the poiypeptldes of the 
invention in animal feed, as v^eil as to feed compositions and feed addit}\^s comprising the 
polypeptides of the invention, 
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The term animal indudes all animals, including human beings. Examples of animals 
are non-ruminants, and ruminants. Ruminant animals include, for example, animals such as 
sheep, goats, horses, and cattle, e.g. beef cattle, cows, and young calves. In a particular 
embodiment, the animal is a non-aiminant animal. Non-ruminant animals include mono- 
5 gastric animals, e.g. pigs or swine (Inciuding, but not limited to, piglets, growing pigs, and 
sows); pouiti-y such as turkeys, ducks and chlcl-^en (including but not limited to broiler chicks, 
layers); young calves; and fish (including but not limited to salmon, trout, tilapia, catfish and 
carps; and crustaceans (including but not limited to shrimps and prawns). 

The term feed or feed composition means any compound, preparation, mixture, or 
10 composition suitable for, or intended for intake by an animal. 

in the use according to tiie invention the protease can be fed to the animal before, 
after, or simultaneously with the diet. The latter is prefetred. 

In a particular embodiment, the protease, in the fomn in which it is added to the feed, 
or when being included in a feed additive, is weil-defined. Well-defined means that the 
16 protease preparation is at least 50% pure as determined by Size-exclusion chromatography 
(see Example 12 of WO 01/58275), In other particular embodiments the protease preparation 
is at least 60, 70, 80, 85, 88, 90. 92, 94, or at least 95% pure as determined by this method. 

A weli-defined protease preparation Is advantageous. For instance, it is much easier 
to dose correctly to the feed a protease that is essentially free from interfering or 
20 contaminating other proteases. The term dose correctly refers in parOcutar to the objective of 
obtaining consistent and constant results, and the capability of optimising dosage based 
upon the desired effect 

For the use in animal feed, however, flie protease need not be that pure; it may e.g. 
include other enzymes, in whidi case It could be termed a protease preparation, 
25 The protease preparation can be (a) addod directly to the feed (or used directly in a 

treatment process of proteins), or (b) it can be used in the production of one or mors 
Intemiedlate compositions such as feed additives or premixes that is subsequently added to 
the feed (or used in a treatment process). The degree of purity described above refers to the 
purity of the original protease preparsition, whether used according to (a) or (b) above. 
30 Protease preparations with purities of this order of magnitude are in particular 

obtainable using recombinant methods of production, whereas they are not so easily 
obtained and also subject to a much higher batt^Ko-batch variation when the prx>tease is 
produced by traditional fermentation methods. 

Sucrfi protease preparation may of course be mixed wth other enzymes. 
35 In a particular embodiment, the protease for use according to tiie invention is capable 

of solubilistng proteins. A suitable assay for deterniining solubilised protein is disclosed in 
Example 3. 
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The protein may be an anima! protein, such as meat and bone meal, and/or fish meal; 
or it may be a vegetabie protein. The term vegetabie proteins as used herein refers to any 
compound, composition, preparation or mixture that includes at least one protein derived 
from or originating from a vegetabie, including modified proteins and protein-derivatives. In 
5 particular embodiments, the protein content of the vegetabie proteins is at least 10, 20. 30, 
40, 5Q, or 60% (w/w). 

Vegetable proteins may be derived from vegetable protein sources, such as legumes 
aiid cereals, for example materials from plants of the families Fabaceae iLeguminosae), 
Caicif&raceae, Chempodiaceae, and Poaceae, such as soy bean meai, lupin mea! and 
10 rapeseed meal. 

In a particular embodiment, the vegetable protein source is materia! from one or more 
piants of the family Fabaceae, e.g. soybean, lupine, pea, or bean. 

In another particular embodiment, the vegetable protein source is materia! from one 
or more plants of the family ChenopodiaceaB, e.g. beet, sugar beet, spinach or quinoa. 
15 Other examples of vegetable protein sources are rapeseed, sunflower seed, cotton 

seed, and cabbage. 

Soybean is a preferred vegetable protein source. 

Other examples of vegetabie protein sources are cereals such as barley, wheat, rye, 

oat, maize (corn), rice, trittcale, and sorghum. 
20 The treatment according to the invention of proteins with at least one protease of the 

Invention results in an increased solubiltsation of proteins. 

The following are examples of % soiubilised protein ot)tatnable using the proteases of 

the invention in a monogastric in sntro model: At least 101%, 102%, 103%, 104%, 105% or at 

least 106% relative to a blank. The percentage of solubilised protein Is determined using the 
25 monogastric in vitm model of Example 3. Hie term solubHlsation of proteins basically means 

bringing prote!n{s} into solution. Such solubillsation may be due to protease-mediated 

release of protein from other components of the usually complex natural composi^ons such 

as feed. 

In a further particular embodiment, the protease for use according to the invention Is 
30 capable of Increasing the amount of digestible proteins. The following are examples of % 

digested or digestible protein obtainable using the proteases of the Invention In a 

monogastric in vitro model: At least 102%, 103%, 104%, 105%, 106%, 107%, 108%, 109%. 

or at least 110%, relative to a blank. The percentage of digested or digestible protein is 

determined using the in vitro model of Example 3, 
35 The following are examples of % digested or digestible protein obtainable using the 

proteases of the invention in an aquaculture in vitm model: At least 104%, 105%, 106%. 

107%, 108%, 109% or at least 110%, relative to a blank. 
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In a stil! further particular embodiment, the protease for use according to the invention 
is capable of increasing the Degree of Hydrolysis (DH) of proteins. Jh& foiiowing are 
examples of Degree of Hydroiysis increase obtainable in a mDnogastric in vitro mode!; At 
least 100%, 101%, 102%. 103%, 104%. 105%, 106%, 107%, 108%, 109%, 110% or at least 
5 111%, relative to a blank. The DH is determined using the monogastrlc in vitro mode! of 
Example 3. 

The following are examples of Degree of Hydroiysis increase obtainabSe in an 
aquaculture in vitro model: At least 102%, 103%, 104%, 106%, 106%, 107%, 108%, 109%. 
110%. 111%, 112%, orat least 113%, relative toablank. 

10 In a particular embodiment of a (pre-) treatment process of the invention, the 

protease{s) in question is affecting (or acting on, or exerting its solubilising influence on) the 
proteins or protein sources. To achieve this, the protein or protein source is typically 
suspended in a solvent, e.g. an aqueous solvent such as water, and the pH and temperature 
vaiues are adjusted paying due regard to the characteristics of the enzyme in question. For 

15 example, the treatment may take place at a pH-value at which the activity of Itie actuai 
protease is at least at least 40%, 50%, 60%. 70%, 80% or at least 90%. Likewise, for 
example, the treatment may take place at a temperature at which the activity of the actual 
protease is at least 40%, 50%, 60%, 70%, 80% or at least 90%, The above percentage 
activity indications are relative to the maximum activities. The enzymatic reaction is 

20 continued until the desired result is achieved, foiiowing which ft may or may not be stopped 
by inactivating the enzyme, e.g. by a heat-treatment step, 

(n another particular embodiment of a treatment pnscess of the invention, the 
protease action is sustained, meaning e.g. that the protease is added to the proteins or 
protein sources, but its solubilising influence is so to speak not switched on until later when 

25 desired, once suitable solubilising conditions are established, or once any enzyme inhibitors 
are inactivated, or whatever other means could have been applied to postpone the action of 
the enzyme. 

in one embodim^t the treatment is a pne-6'eatment of animal feed or proteins for use 
m animai feed, 

30 The term Improving the nutritional value of an animal feed means Improving itie 

availability and/or digestibility of the proteins, thereby leading to Increased protein extraction 
from the diet components, higher protein yields, increased protein degradation and/or im- 
proved protein utilisation. The nutritional value of the feed is therefore increased, and the 
animal performance such as grovs/th rate and/or weight gain and/or feed conversion ratio (i.e. 

36 the weight of ingested feed relative to weight gain) of the animal is/are improved, 

in a particular embodiment the feed conversion mtio is Increased by at least 1%, 2%, 
3%, 4%, 5%, 6%, 7%, 8%, 9% or at least 10%. in a further particular embodiment the weight 
gain is Increased by at least 2%, 3%. 4%, 5%. 6%. 7%. 8%, 9%, 10% or at least 11%. 
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These figures are reSative to controi experiments with no protease addition. 

The feed conversion ratio (FCR) and the weigi-st gain may be caiculated as described 
in EEC (1986); Directive de !a Commission du 9 avril 1986 fixant la methode de caScu! de la 
valeur 6n6rget!que des aliments composes destines a la vojailie. Joumai Officiel des 
5 Communautes Europeennes, L1 30, 53 - 54. 

The protease can be added to the feed In any form, be It as a relatively pure 
protease, or in admixture with other componente intended for addition to animal feed, i.e. in 
the form of animal feed additives, such as the so-ca!ied pre-mixes for animal feed, 

jn a further aspect the present invention relates to compositions for use in animal 
10 feed, such as animal feed, and animal feed additives, e.g. premixes. 

Apart from the protease of the invention, the animal feed additives of the invention 
contain at least one fat-soluble vitamin, and/or at least one v^ater soluble vitamin, and/or at 
least one trace mineral The feed additive may also contain at least one macro mineral. 

Further, optional, feed-additive ingredients are colouring agents, e.g. canatenolds 
16 such as beta-carotene, astaxantliin, and lutein; aroma compounds; stabilisers; antimicrobial 
peptides; polyunsaturated fatty acids; reactive oxygen generating species; and/or at (east 
one other enziflne selected from amongst (phytase (EC 3.1.3.8 or 3.1,3.26); xylanase (EC 
3,2.18); galactanase (EC 3.2.1.89); alpha-galactosidase {EC 3.2.1.22); protease (EC 3.4. , ). 
phosphoiipase A1 {EC 3.1.1.32); phospholipase A2 {EC 3.1.1,4); lysophospholipase (EC 
20 3.1.15); phospholipase C (3.14.3); phospholipase D (EC 3,14.4); amylase such as, for 
example, aipha-amylase {EC 3.2.11); and/or beta-glucanase (EC 3.2,1.4 or EC 3.2.16). 

In a particular embodiment these other enzymes are well-defined (as defined above 
for protease preparations). 

Examples of antimicrobia! peptides (AMP's) are CAP18, Leucodn A, Tritrpticin, Pro- 
25 tegrin-l Thanatin, Defensin, Lactofenin, Lactofemcin, and Ovlspirin such as Novispinn 
(Robert Lehrer, 2000), Plectasins, and Statins, including the compounds and pol^eptides 
disclosed in WO 03/044049 and WO 03/048148, as well as variants or fragments of the 
above that retain antimicrobial actiwty. 

Examples of antifungal polypeptides (AFP's) are the Aspergillus giganteus, and 
30 Aspet^illus niger peptides, as well as variants and fragments thereof which retain antifungal 
activity, as disclosed in WO 94/01459 and WO 02/090384. 

E)Qmpies of polyunsaturated fatty adds are CI 8, C20 and C22 polyunsaturated fatty 
acids, such as arachidonic acid, docosohexaenoic acid, eicosapentaenoic add and ganima- 
linoleic acid. 

35 Examples of reactive oxygen generating species are chemicals such as pettorate, 

persulphate, or percarbonate; and enzymes such as an o)ddase, an oxygenase or a 
syntethasaUsally fat- and water-soluble vitamins, as well as trace minerals form part of a so- 
called premix intended for addition to the feed, whereas macro minerals are usually 
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separately added to the feed, A premix enriched with a protease of tiie invention, is an 
example of an animal feed additive of the invention. 

In a particular en^bodiment, the animal feed additive of the invention is intended for 
being included (or prescribed as fiaving to be included) in animal diets or feed at levels of 
5 0.01 to 10.0%; more particularly 0.05 to 5,0%; or 0.2 to 1.0% (% meaning g additive per 100 
g feed). This is so in particuiar for premixes. 

1Tie foilowing are non-exclusive lists of examples of these components; 

Examples of fat-soiubie vitamins are vitamin A, vitamin D3, vitamin E, and vitamin K, 
e.g. vitamin K3, 

10 Examples of water-soluble vitamins are vitamin B12, biotin and choline, vitamin B1, 

vitamin B2, vitamin B6, niacin, folic acid and panthothenate, e.g. Ca-D-panthothenate. 

Examples of trace minerals are manganese, zinc, iron, copper, iodine, selenium, and 

cobalt 

Examples of macro minerals are calcium, phosphorus and sodium. 

15 Itse nutritionai requirements of these components (exemplified Vi/lth poultry and 

piglets/pigs) are listed in Table A of WO 01/68275. Nutritional requirement means that these 
components should be provided in the diet in the concentrations indicated. 

In the aiternative, the animal feed additive of the invention comprises at least one of 
the individual components specified in Table A of WO 01/58275. At least one means either 

20 of, one or more of, one, or two, or three, or four and so forth up to afl thirteen, or up to ali 
fifteen individual components. More specificaiiy, this at ieast one individual component is 
included in the additive of the invention in such an amount as to provide an in-feed- 
concentration within the range Indicated in column four, or coiumn five, or column six of 
Table A, 

25 The present invention also relates to animal feed compositions. Animal feed 

compositions or diets have a relatively high content of protein. Pouitry and pig diets can be 
characterised as indicated in Table B of WO 01/58275, columns 2-3. Rsh diets can be 
characterised as indicated in column 4 of this Table B. Furthermore such fish diets usually 
have a crude fat content of 200-310 g/!<Q. WO 01/68275 corresponds to 09/779334 which Is 

30 hereby incorporated by reference. 

An animal feed composition according to the invention has a crude protein content of 
50-800 gkg, and furthemiore comprises at leas^ one protease as ciaimed herein. 

FurUiermore, or in the alternative (to the crude protein content indicated above), the 
animai feed composition of the invention has a content of metabolisabie energy of 10-30 

35 MMq; and/or a content of calcium of 0.1-200 g/kg; and/or a content of available phosphorus 
erf 0.1-200 gikg; and/or a content of methionine of 0,1-100 g/kg; and/or a content of 
methionine plus cysteine of 0, 1 -1 50 g/kg; and/or a content of lysine of 0.5-50 g/kg. 
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In particular embodiments, the content of metabofisaWe energy, crude protein, 

calcium, phosphorus, methionine, methionine plus cysteine, and/or lysine is within any one of 

ranges 2, 3, 4 or 5 in Table B of WO 01/58275 (R. 2-5), 

Crude protein is calculated as nitrogen (N) multiplied by a factor 6,25, !,e. Crude 
5 protein (g/'kg)- N (g/kg) x 6,25. The nitrogen content is determined by the l^eldaiii method 

(A,0,A,C,, 1984, Official Methods of Analysis 14ih ed.. Association of Officia! Anaiytical 

Chemists, Washington DC). 

Metaboltsafaie energy can be caiculated on the basis of the NRC pubifcation Nutrient 

requirements in swine, ninth revised edition 1988, subcommittee on swine nutrition, 
10 committee on animai nutrition, board of agricuiture, national research councti. National 

Academy Press, Washington, D.C., pp. 2-6, and the European Table of Energy Values for 

Poultry Feed-stuffs, Spelderhoit centre for poultry research and extension, 7361 DA 

Beekbergen, The Netherlands. Grafisch bedrijf Ponsen & iooijen bv, Wageningen. ISSN 90- 

71463-12-5. 

15 The dietary content of calcium, avaiiabie phosphorus and amino acids in complete 

animal diets is calculated on the basis of feed tables such as Veevoedertabei 1997, 
gegevens over chemische samenstelling, verteerbaarheld en voederwaarde van 
voedermiddelen. Central Veevoederbureau, Rundenweg 6, 8219 pk Leiystad. ISSN 90- 
72839-13-7. 

20 in a particular embodiment, the animal feed composition of the invention contains at 

least one protein or protein source as defined above. It may also contain animal protein, such 
as Meat and Bone Meal, and/or Fish Meal, typically tn an amount of 0-25%. 

In stiii further particular embodiments, the animal feed composition of the invention 
contains 0-80% maize; and/or 0-80% sorghum; and/or 0-70% wheat; and/or 0-70% Barley; 

26 and/or 0-30% oats; and/or 0-40% soybean meai; and/or 0-25% fish meal; and/or 0-25% meat 
and bone meai; and/or 0-20% whey. 

Animal diets can e.g. be manufactured as mash feed (non pelleted) or pelleted feed. 
Typically, the milled feed-stuffs are mixed and sufficient amounts of essential vitamins and 
minerals am added according to the spedficatlons for the species in question. Enzymes can 

30 be added as solid or liquid enzyme forniulations. For example, a solid enzyme formulation is 
typically added before or during the mixing step; and a liquid enzyme preparation Is typically 
added after ti^e pelleting step. The enzyme may also be incorporated in a feed additive or 
premix. 

The final enzyme concentration in the diet is within the range of 0.01-200 mg enzyme 
35 protein per kg diet, for example In the range of 0.5-25 mg enzyme protein per kg animal diet. 

The protease should of course be applied In an effective amount, i.e. in an amount 
adequate for impmving solubilisation and/or improving nutritional value of feed. It is at 
present contemplated that the enzyme is administered in on© or more of the following 
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amounts (dosage ranges): 0.01-200; 0,01-100; 0.5-100; 1-50; 5-100; 10-100; 0.05-50; or 
0.10-10 - all these ranges being in mg protease enzyme protein per kg feed (ppm). 

For determining mg enzyme protein per kg feed, the protease is purified from the feed 
composition, and tlie speclfsc activity of the purified protease is determined using a relevant 
5 assay (see under protease activity, substrates, and assays). The protease activity of the feed 
composition as such is aiso determined using tUe same assay, and on the basis of these two 
determinations, the dosage in mg enzyme protein per kg feed is caScuiated. 

The same prindpfes apply for determining mg enzyme protein in feed additives. Of 
course, if a sampie is available of the protease used for preparing the feed additive or the 
10 feed, the spedflc activity is detemnined from this sample (no need to purify the protease from 
the feed composition or the additive). 

The present Invention is further described by the following examples which should not 
be construed as limiting the scope of the Invention. 



1 5 Detergent Compositions 

The protease of the invention may bs added to and thus become a component of a 
detergent composition. 

The detergent composition of the invention may for example be formulated as a hand 
or machine laundry detergent composition including a laundi^ additive composition suitable 
20 for pre-lreatment of stained fabrics and a rinse added fabric softener composition, or be 
formulated as a detergent composition for use In general household hard surface cleaning 
operations, or be formulated for hand or machine dishwashing operations. 

In a specific aspect, the invention provides a detergent additive comprising the 
protease of the invention. The detergent additive as well as the detergent composition may 
25 comprise one or more other enzymes such as another protease, such as alkaline proteases 
from Bacillus, a ilpase, a cutinase, an amj^ase, a carbohydrase, a cellulase, a pectinase, a 
mannanase, an arablnase, a galactanase, a j^lanase, an o)dda$e, e.g., a laccase, and/or a 
peroxidase. 

in general the properties of the chosen enzyme(s) should be compatible with the 
30 selected detergent, {i.e. pH-optlmum, compatibility wth oiiier enzymaSc and non-enzymatic 
ingredients, etc.), and the enzyme(s) should be present In effective amounts. 

Suitable lipases include those of bacterial or fungal origin. Chemically modified or 
protein engineered mutants are Included. Examples of useful lipases include lipases from 
HiimiCQla (synonym Thermomyces), e.g. from H, lanuginosa {T. Imuginosus) as described in 
35 EP 258068 and EP 305216 or from H. insohns as described in WO 96/13580, a 
Ps&udomonas iipase, e.g. from P. afoaiigmes or P. pseudoaloaiigenes (EP 218272), P. 
cepacia {EP 331376), P. sMzeri ((SB 1,372,034), P. fiuoresoBns, Pseudomonas sp. strain 
SD 705 (WO 95/06720 and WO 96/27002), P. wiscomln&nm (WO 96/12012), a BaciUus 
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Ipase, e.g. from B. subtilis {Dartots et af. (1993), Biochemica et Biophysica Acta, 1131, 253- 
360), e, stearothermophilus (JP 64/744992) or B. pumilus (WO 91/16422). Other exampies 
are iipase variants such as those described in WO 92/05249. WO 94/01541, BP 407225, EP 
260105, WO 95/35381, WO 96/00292, WO 95/30744, WO 94/25578, WO 95/14783, WO 
5 95/22615, WO 97/04079 and WO 97/07202. Preferred comniercially available iipase 
enzymes indude Lipolase^and Upolase Ultra™ (Novozymes A/S). 

Suitable amylases (alpha- and/or beta-) inciude those of bacterial or funga! origin, 
Chemicafly modified or protein engineered mutants are included. Amylases inciude, for 
example, alpha-amylases obtained from Bacillus, e.g. a special strain of S. iicheniformis, 

10 described in more detai! in GB 1,298,839, Examples of useful amylases are the variants 
described in WO 94/02597, WO 94/18314, WO 96/23873, and WO 97/43424, especialiy the 
variants vt'ith substitutions in one or more of the folfow/ing positions: 15, 23, 105, 106, 124, 
128, 133, 154, 156, 181, 188, 190, 197, 202, 208, 209, 243, 264, 304, 305, 391, 408, and 
444. Commerciaily available amylases are DuramyP"', TermamyP', Fungamyr'^Vand BAN™ 

1 5 (Novozymes A/S), Rapidase^^"'' and Purastar™ (from Genencor fntennationa! inc.). 

Suitable cellulases include those of bacterial or funga! origin, Chemicaliy modified or 
protein engineered mutants are included. Suitable cellulases include celluiases from the 
genera Bacillus, Pseudomonas, Humioola, Fasariurn, Thielavia, Acremoniurn, e.g. Ihe funga! 
cellulases produced from Humicola insolens, Mycellophthora thermophila and Fusarium 

20 oxysporum disclosed in US 4,435.307, US 5,648,263, US 5,691 ,178, US 5,776,757 and WO 
89/09259. Especially suitable cellulases are the alkaline or neutral ceiiuiases having colour 
care benefits. Examples of such cellulases are cellulases described in EP 0 495257, EP 
531372, WO 96/11262, WO 86/29397, WO 98/08940. Other examples are ceilulase variants 
such as those described in WO 94/07998, EP 0 531 315, US 5,457,046, US 5,686,593, US 

25 5,763,254, WO 95/24471, WO 98/12307 and WO 99/01544, Commercially available 
cellulases Include Celluzyme™, and Carezyme™ {Novozymes A/S), Glazinase™, and 
PuradaxHA^^'* (Genencor Internationa! Inc.), arid KAC~500(8)™{Kao Corporation). 

Suitable peroxidases/oxidases include those of plant, bacterial or fungai origin. 
Chemicaiiy modified or protein engineered mutants are included. Examples of useful 

30 peroxidases include peroxidases from Coprinus, e.g. from C. cinefeus, and variants thereof 
as those descaibed in WO 93/24618, WO 95/10602, and WO 98/15257, Commerciaily 
availabie peroxidases include Guardzyme™ (Novozymes). 

The detergent en2yme(s) may be included in a detergent composition by adding 
separate additives containing one or more enzymes, or by adding a combined additive 

35 comprising ai! of these enzymes. A detergent additive of the invention, i.e, a separate 
additive or a combined additive, can be formulated e.g, as a granulate, a liquid, a slurry, etc. 
Preferred detergent additive fom^ulations are granulates, in parficular non-dusting 
granulates, liquids, in particular stabiiized iiquids, or slurries. 
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Non-dusting granulates may be producsd, e.g., as disciosed in US 4,106,991 and 
4,661,452 and may optionally be coated by methods known in the art. Examples of waxy 
coating materials are poly(ethy!ene oxide) products (polyethylenegiycoi, PEG) with mean 
moiar weights of 1000 to 20000; 6!hox>1ated nonylpiienols having from 16 to 50 ethyiene 
5 oxide units; ethoxyiated fatty alcohols in wlilch the alcohol contains from 12 to 20 carbon 
atoms and in which there are 15 to 80 ethylene oxide units; fatty aicohoSs; fatty adds; and 
mono- and di- and triglycerides of fatty acids. Examples of film-forming coating materials 
suitable for application by fluid bed techniques are given in GB 1483591. Liquid enzyme 
preparations may, for instance, be stabilized by adding a polyol such as propylene glycol, a 
10 sugar or sugar alcohoi, lactic acid or boric acid according to established methods. Protected 
enzymes may be prepared according to the method disciosed in EP 238216. 

The detergent composition of the invention may be in any convenient form, e.g., a 
bar, a tablet, a powder, a granule, a paste or a liquid. A liquid detergent may be aqueous, 
typically containing up to 70 % water and 0-30 % organic solvent, or non-aqueous. 
15 The detergent composition comprises one or more surfactants, which may be non- 

ionic including semi-polar and/or anionic and/or catfonic and/or ssvittertonic. The surfactants 
are typically present at a level of from 0.1% to 60% by weight. 

When included therein the detergent wtf! usually contain from about 1% to about 40% 
of an anionic surfactant such as iinear aikyfbenzenesuifonate, alpha-olefinsulfonate, alkyf 
20 sulfate {fatty alcohol sulfate), alcohoi ethoxysuifate, secondary aliianesulfonate, aipha-sulfo 
fatty acid methyl ester, alky!- or aikenylsuccinic acid or soap. 

When included therein the detergent wili usually contain from about 0.2% to about 
40% of a non-ionic surfactant such as alcohol ethoxyiate, nonyiphenol ethoxyiate, 
aii^ylpolygiycoside, alkyldimethylamineoxlde, ethoxyiated fatty acid monoethanolamide, fatly 
25 acid monoeftianolamide, poiyhydroxy alkyi fatty acid amide, or N-acyl N-alkyl derivatives of 
glucosamine {"glucamides"). 

The detergent may contain 0-65 % of a detergent builder or complexing agent such 
as zeolite, diphosphate, triphosphate, phosphonate, carbonate, catrate, nitrilotriacetic add, 
ethyienediaminetetraacettc acid, diethylenetriaminspentaaceWc add, alkyf- or alkenylsucclnlc 
30 acid, soluble silicates or layered silicates {e.g. SKS-6 from Hoechst), 

Hie determent may comprise one or more polymers. Examples are 
cartaoxymethylceilulose, poly(viny!pyrrolidone), poly {ethylene glycol), poly(vfny! alcohoi), 
poly(vlfiylpyrtdine-N-oxide), poly<^«nylimidazoie), polycarboxyiates such as polyacryiates, 
maleic/acri^icadd copolymers and lauryi methacryiate/acryilc acid copolymers. 
35 The detergent may contain a bleaching system which may comprise a HzOa source 

such as perborate or percarbonate which may be combined with a peracid-forming bleach 
activator such as tetraacetyiethyienediamine or nonanoyloxybenzenesulfonate. Alternatively, 
the bleaching system may comprise peroxyacids of e.g. the amide, imide, or suifone type. 
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The enzyme(s) of the detergent composition of the invention may be stabilized using 
conventional stabilizing agents, e.g„ a polyol such as propylene glyco! or giyceroi, a sugar or 
sugar aicohoi, iactic acid, boric acid, or a boric acid derivative, e.g., an aromatic borate ester, 
or a pheny! boronic acid derivative such as 4-formyipheny! boronic acid, and the composition 
5 may be formuiated as described in e.g. WO 92/1 9709 and WO 92/1 9708. 

The detergent may aiso contain other conventlonai detergent ingredients such as e.g. 
fabric conditioners including ciays, foam boosters, suds suppressors, arrli-corrosion agents, 
soli-suspending agents, anti-soil redeposition agents, dyes, bactericides, optical brighteners, 
hydrotmopes, tarnish inhibitors, or perfumes. 
10 It is at present contemplated that in the detergent compositions any enzyme, in 

particular the enzyme of the invention, may be added in an amount corresponding to 0.01- 
100 mg of enzyme protein per itter of wash liqour, preferably 0,05-5 mg of enzyme protein 
per liter of wash liqour, in particular 0.1-1 mg of enzyme protein per iiter of v/ash llqour. 

The enzyme of the invention may addltionaliy be incorporated in the detergent 
15 fomiulations disclosed in WO 97/07202. 



Deposit of Btotogicai Material 

The following biological material has been deposited under the terms of the Budapest 
Treaty with the Deutsche Sammiung von Mikroorgantsrnen und Zetifcuituren GmbH, 

20 Mast^eroderWeg lb, D-38124 Braunschweig, and given the foilowing accessiori number; 
Deposit Accession Number Date of Deposit 

Nocardiop^s pradna DSM 15648 May 3D, 2003 

The strain has been deposited under conditions that assure that access to the culture 
will be avaiiable during the pendency of this patent application to one determined by the 

25 Commissioner of Patents and Trademarks to be entifled thereto under 37 C.F.R. §1,14 and 
35 U.S.C, §122. The deposit represents a substantlaliy pure culture of the deposited strain. 
The deposit is avaiiable as required by foreign patent laws In countries wherein counterparts 
of Uie subject application or its progeny are filed. However, it should be understood that the 
avaiiabtiity of a deposit does not constitute a license to practice the subject invention in 

30 derogation of patent rights granted by governmental action. 

The above strain was isolated In 2001 from a soil sample from Denmark. 
The invention described and claimed herein is not to be limited In scope by the 
specific embodimente herein disclosed, since these embodiments are Intended as 
Illustrations of several aspects of the invention. Any equivalent embodiments are intended to 

35 be wthin the scope of tils invention, indeed, various modifications of the invention in addition 
to those shown and described herein wili become apparent to those skilled in &ie art from the 
foregoing desaiption. Such modifications are also intended to feit within fh& scope of the 
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appended claims, in the case of conflict, tfie present disciosufe including definitions will 
control. 

Various references are dted herein, the disclosures of which are incorporated by 
reference in their entireties. 

6 

EXAMPLES 

EXAMPLE 1; Cloning and expressjon of the protease from Nocardfopsis prasina DSM 

15648 

Reagents and media 

10 IB agar Described in Ausubel, F. M. et al. (eds.) "Current protocols in Molecular 

Biology". John Wiley and Sons, 1985 
LB-PGagar LB agar suppiemented with 0.5% Glucose and 0,05 M potassium 
phosphate, pH 7.0 

PS-1 10% sucrose. 4% soybean flour, 1% NasPO^-iaHzO, 0,5% CaCOg, and 

15 0.01% pluronic acid 

TE lOmMTris-HCI, pH7.4 

1 mU EDTA. pH 8.0 
TEL SO mg/ml Lysozym In TB-buffer 

Thfocyanate 5y guanidium thiocyanate 
20 100 mM EDTA 

0.6 % w/v N-laurylsarcosine, sodium salt 

60 g thiocyanate, 20 ml 0.5 M EDTA, pH 8,0, 20 mi H^O dissolves at 65X. 

Coo! down to room temperature {RT} and add 0,6 g N-iaury!sarcosine. Add 
HjO to 100 mi and filter it through a 0.2 [j sterile filter. 

25 NNAc 7.5MCKCOONH^ 

4 3 4 

TER 1 \}Qim\ Rnase A in TE-buffer 

CIA Chlorofornrs/isoamyi alcohol 24:1 

Cioninq of SEQ iP NO: 1 , 
30 SEQ ID NO; 1 Is the DNA sequence encoding a pnoform of the protease from 

Nocardiopsis prasina DBM 15648. Nucleotides 496 -1059 corresponds to the mature peptide 
encoding part. SEQ ID NO; 2 is the deduced amino add sequence of SEQ ID NO; 1 . Amino 
acids -165 to -1 is the propeptide, and amino acids 1 to 188 the mature peptide. 

The wild type was grown for 3 days before harvest in the following medium at SCC; 

35 Trypticase 20 g 

Yeast extract 5 g 

Ferrochloride 6 mg 

Magnesiumsuifate 15mg 

Distilled water ad 1000 ml 
40 pH adjusted to 9 by addition of sodium carbonate 

Genomic DNA from Nocardiopsis prasina DSM 15648 was isolated according to 

the following procedure: 

1 . Han/est 1.5 m! culture and resuspend in 100 [i\ TEL. Incubate at 37"C for 30 mln, 

2. Add 500 pi thiocynate buffer and leave at room temperafcre for 10 min. 
45 3. Add 250 p! NHjfAc and leave at ice for 1 0 min. 

4, Add 500 p! C!A and mix, 

5. Transfer to a miGrocentrifuge and spin for 10 min, at full speed. 

39 



wo 2*H>4/1 1 1 222 P€T/BK2&ft4/(>00434 

6. Transfer supernatant to a new Eppendorf tube and add 0,64 voiume cold isopropanol. Mix 
thoroughly. 

7. Spin and wash the DNA pellet with 70 % EtOH, 

8. Resuspend the genomic DNA in 100 pi TER. 

5 The genomic DNA was used as template for PGR amplification using below 

primers SEQ ID NOS. 3 and 4, The PGR fragment was isolated on a 0.7% agarose gei. 
Primer 1 346; 5 - GTT CAT CGA TCG CAT CGG CTG CCA CCG GAC CGC TCC CGC AGT 
C -3' (SEQ iD NO; 3) 

Primer 1602: 6'- GCG GAT COT ATT AGS TCC GGA GAC GGA CGC CCC AGG AG-3" 
10 (SEQ ID NO: 4) 

The digested and purified PGR fragment was ligated to the Cia I and BamH I 
digested piasmid pDG268NeoMCS-PramyQ/Prcrylfi/cryUfAstab/Sav (United States Patent; 
5.955,310). 

The ligation mixture was used for tmnsformation into E. coH TOP10F' (Snvftrogen 
15 BV, The Netherlands) and several colonies were selected for miniprep (QIAprep spin, 
QfAGEN GmbH, Germany), The purified plasmids were checked for insert before 
transformation into a_strain of Bacillus suhtilis derived from B. subtilis DN 1885 with disrupted 
apr, npr and pel genes (Diderichsen et a( (1990), J. Bacteriol., 172, 4315-4321). The 
disruption was performed essentially as described in "Bacillus subtilis and other Gram- 
20 Positive Bacteria," American Society for IVJicrobioiogy, p.618, eds. A.L. Sonenshein, J.A. 
Hoch and Ridnard Losick (1S93). Transformed ceils were plated on 1% si<im ml!!< LB-PG 
agar plates, supplemented with 6 jc/g/ml chloramphenicol. The plated ceils were incubated 
over night at Z7°C and protease containing colonies were identified by a surrounding deartng 
zone. Protease positive colonies were selected and the coding sequence of the expressed 
25 enzyme from the expression construct was confinned by DNA sequence analysis. 

Fermentation 

The B&ciilus subtilis host cell transfonned as described above was fennented on a 
rotary shai<ing table (260 r.p.m.) in 500 ml baffled Erlenmeyer flasks containing 100 ml PS-1 
30 medium supplemented with 6 //g/ml chtoramphenicoi, at 37'C for 16 hours and at ae'C for 
extra 4 days. 

EXAMPLE 2: Purification and characterization of the protease from Nocardiofisis 
pmsina DSM 15648 

35 

Protease assays 
1)pNA assay: 

pNA substrate : Suc-AAPF-pNA (Bachem L-1400). 
Temperature ; Room temperature (25°C) 
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Assay buffers :100my succinic acid, lOOmM HEPES, lOOmM CHES, lOOmM CABS, 
ImM CaCla. IdOmM KCI, 0.01% Triton X-100 adjusted to pH-values 2.0, 2.5. 3.0> 3.5, 4.0, 
5.0, 8.0, 7.0, 6.0, 9.0, 10.0, 11.0, and 12.0 with HCl or NaOH, 

5 20,ul protease (diluted in 0.01% Triton X-100) is mixed with 100i.il assay buffer, the 

assay is started by adding lOOjit! pNA substmte {60mg dissolved in 1.0ml DMSO and further 
diluted 45x with 0.01 % Triton X-100). The increase in OD^as is monitored as a measure of the 
protease activity, 

10 2) Protazyme AK assay: 

Substrate : Protazyme AK tablet (cross-linked and dyed casein; from Megazyme) 
Temperature : controlled (assay temperature). 

Assay buffers ilOOmM succinic add, lOOmM HEPES, lOOmM CHES. lOOmM CABS, 
1mM CaCI?, ISOmM KCI, 0.01% Triton X-100 adjusted to pH-vaiues 2.0, 2.6, 3.0, 3.5, 4.0, 
1 3 5.0, 6.0, 7.0, 8.0, 9.0, 1 0.0 and 1 1 .0 with HCl or NaOH. 

A Protazyme AK tablet is suspended in 2.0ml 0.01% Triton X-100 by gentle stirring, 
SOO^il of this suspension and SOOfii assay buffer are mixed in an Eppendorf tube and placed 
on ice. 20M.i protease sample {diluted in 0.01% Triton X-IOO) is added. The assay is initiated 

20 by transf^ring the Eppendorf tube to an Eppendorf thermomixer, which is set to the assay 
temperature. The tube is incubated for IS minutes on the Eppendorf thennomixer at its 
highest shaking rate (1400 rpm). Hie incubation is stopped by transferring the tube back to 
the ice bath. Then the tube is centnfuged in an icecoid centrifuge for a few minutes and 200fjl 
supernatant Is transferred to a micratiter plate, ODeso is read as a measure of protease 

25 activity. A buffer blind is included in the assay (instead of enzyme). 

Purifioation 

Culture broth from the fermentation described in Example 1 was centrifuged {20000 x 
g, 20 min) and the supematants were carefully decanted from the precipitates. The combined 

30 supernatants were filtered through a Seitz EKS plate in order to remove the rest of the Badi- 
lus host cells. The EKS fStrate was applied to a badtracin silica coiumn equilibrated in 
lOOmM H3BO3, 10mM succinic add, 2mM CaCla, pH 7. Aner washing the column extensively 
with the equilibration buffer, the protease was step-eiuted with lOOmiVl H38O3, lOmM suc- 
cinic acid, 2miV! CaCis, 1M NaCl, 25% (soprc^noi. pH 7. The bacitracin eiuate was trans- 

35 ferred to SOmM H38O3, iOmM CH3COOH, ImM CaCla, pH 4.S on a G25 sephadex coiumn 
and applied to a S sepharose HP coiumn equilibrated in the same buffer. After washing the 
column extensively with the equilibration buffer, the protease was eiuted with a linear NaCl 
gradient {0 -> Q.6M) in the same buffer. Fractions from the coiumn were analysed for prote- 
ase activity (using the Protazyme AK assay at ST'C and pH 9) and active fractions were fur- 

40 ther analysed by SDS-PAGE. Fractions, where oniy one band was seen on the coomassie 
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Stained SDS~PAGE gel, were pooisd as the purified preparation and was used for further 
characterization. 



pH-aclivttv . pH-stabilitv. and temperature-activity 
5 The pNA assay was used for obtaining the pH-activily profile as well as the pH- 

stabiliiy profile. For the pH-stabitity profile ti*ie protease was diluted 10x in tiie assay buffers 
and incubated for 2 hours at 37"C. After incubation the protease samples were transferred to 
the same pH - pH 9, before assay for residuai activity, by diiution in the pH 9 assay buffer. 
The Protazyme AK assay was used for obtaining the temperature-activity profile at pH 0. The 
10 results are shown in Tables 1-3 beiow. 



lie 1: pH-activity profiie 





Protease derived from Nocar- 


Protase derived from Nocar- 


PH 


diopsis prssina DSM 15648 


dfopsis sp. NRRL 18262 


2 


0.00 




3 


0.01 


0.00 


j4 


0.02 


0.02 


5 


0.07 


0.07 


6 


0.25 


0.21 


7 


0.37 


0.44 


8 


0,52 


0.67 


9 


0.65 


0,88 


10 


0.76 


100 


11 


1.00 


0.93 
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Table 2: pH-stabiilty profile 





Protease derived from Nocar- 
diofisis prasina DSM 1 SS4S 


Protase derived from No- 
cardtopsis sp, NRRL 18262 


2.0 


0,86 


0,78 


2,6 


0,95 


1.00 


3.0 


0,96 


1.03 


3.5 


0,95 


0.98 


4>0 


0.77 


0.99 


5,0 


6.80 


102 


6,0 


0.98 


1.00 


7,0 


0.99 


1.01 


8 0 


0.98 


0.98 


9 0 


1.01 


0.99 


-1 

10.0 


1.01 


0.99 


11.0 


0.91 


0.86 


12,0 






9,0 and after 2 

hours at 


1.00 


100 


Table 3; Temperature activity prcffiie 




Tempei^ture 
fC) 




Protease derived from Nocar~ 
diopsts prasina OSM 15648 


1 Protase derived from /Vo- 
1 cardhpsis sp. HRRL 18262 


15 


0.04 


1 0.02 


25 


0.06 


i 0,02 


37 


0.15 


1 0,07 


60 


0,34 


0.20 


60 


0.65 


0,51 


70 


1.00 


1.00 


80 


0.42 


] 0.39 



90 



0 Qther diaracteristiics 

The protease was found to be inhibited by Phenyl Methyl Sulfonyl Fluoride. Its 
relative molecular weight as determined by SDS-PAGE was Mr « 21kDa, and the N-termlnal 
sequence; ADMGGLAYTMG, 

10 EXAMPLE 3: Performance of the Nocaixfiopsis prasma DSM 15648 protege in a 
monogastrtc in v/tro digestion mode! 

The perfom^ance of a purified preparation of the mature part of the protease having 
SEQ ID NO; 2 (prepared as described in Examples 1 and 2) was tested in an in vitm model 
simulating the digestion In monogastric animais. In particular, the protease was tested for its 
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ablltty to improve solubilisation and digestion of malze/-SBM (maizeZ-soybean mea!) proteins, 
The in vitro system consisted of 20 flasks in which maize/-SBy substrate was ini- 
tially incubated with HCi/pepsin - simuSating gastric digestion - and subsequently with pan- 
creatin - simuSating intestinal digestion. 1 5 of the flasi<s were dosed with the protease at the 
5 start of the gastric phase whereas the remaining flasks served as bianks. At the end of the 
intestinal incubation pliase samples of in vitro digesta were removed and analysed for solubi- 
lised and digested protein. 



Outline of Iff y/fro digestion procedure 



Components added 


pH 


Temperature 


Time 
course 


Simulated diges* 
tlort phase 


10 g maize/-S8i\<1 substrate 
(6:4), 41 mi HCI {O.IOSM) 


3.0 


40'*C 


twQmin 


Mixing 


5 mi HCi (O.iOSiV!)/ pepsin 
(3000 U/g substrate), 1 ml 
protease of the invention 


3.0 


40'C 


t=30 min 


Gastric digestion 


16 ml H^O 


3.0 


40"C 


t= 1.0 hour 


Gastric digestion 


7 m! NaOH {0,39M) 


6.8 


40'C 


t=1.5hours 


intestinal diges- 
tion 


5 m! NaHC03{irvl)/pan - 
creatin (8 mg/g diet) 


6.8 


40*C 


t=2.0 hours 


intestinal digesti- 
on 


Terminate incubation 


7.0 


40'C 


t«6.0 hours 





10 
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Conditions 

Substrate: 4 g SBM, 6 g maize (premixed) 

pi-i: 3.0 stomad) step/ 6.8-7,0 intestinal step 

HC!: 0.105 M for 1 ,5 hours {i.e. 30 min HCI-substrate premixing) 

pepsin: 3000 U /g diet for 1 hour 

pancreatin : 8 mg/g diet for 4 hours 

temperature: 40*0, 

Replicates: 5 



20 Solutions 



25 



0.39 M NaOH 
0.105 M HCi 

0.105 M HC! containing 6000 U pepsin per 5 mi 
1 y NaHCOg containing 16 mg pancreatin per mi 
125 mM t^aAc-buffer, pH 6,0 



Enzyme protein determinations 

The amount of protease enzyme protein (in what follows, Enzyme Protein is abbrevi- 
ated EP) is calculated on the basis of the Aaso vaSues and the amino acid sequences (amino 
30 acid compositions) using the principies outiined in S.C.Giit & P.H, von Hippei, Analytical Bio- 
chemistry 182, 319-326. (1989). 



Ex perimentai proced ur e foL/n w/'q model 

The experimental procedure was according to ttie above outline. pH was measured at 
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time 1, 2.5, and 5.5 hours. Incubations were terminated after 6 hours and sampies of 30 ml 
were removed and placed on ice before centrifugation (10000 x g, 10 mm, 4''C). Supsr- 
natanis were removed and stored at -20X. 



5 Analysis 

All samples were anaiysed for % Degree of protein with ttie CPA method as we!! as 
content of soiubiiised and digested protein using gei filtration. 

DH determination by the OPA-niethQcj 

10 T!ie Degree of Hydrolysis (DH) of protein in different samples was detemiined using 

an semi-automated microtiter plate based colorimetric metfiod (Nielsen.P.M.; Petersen.D.; 
Dambmann.C. improved method for determining food protein degree of hydrolysis. J. Food 
Sot. 2001, 66, 642-646). The OP A reagent was prepared as follows: 7.620 g di-Na tetrabo- 
rate decahydrate and 200 mg sodiumdodecyl sulphate (SDS) were dissolved in 150 m! de- 

15 ionized water. The reagents were completely dissolved before continuing. 160 mg o-phthal- 
dialdehyde 97% (OPA) was dissolved in 4 ml ethanoi. The OPA solution was transferred 
quantitatively to the above-mentioned solution by rinsing with deionized water. 176 mg dithio- 
threitoi 99% (DTT) was added to tiie solution that sm&s made up to 200 ml with deionized wa- 
ter. A serine standard {0.9516 meqv/i) was prepared by soiubillsing 50 mg serine {Merci<, 

20 Gemiany) In 500 ml detonized water 

The sample solution was prepared by diluting each sample to an absorbance (280 
nm) of about 0.6. Generally, supematants were diluted (100 «) using an automated Tecan 
dilution station (MSnnedorf, Swltzeriand). All other spectrophotometer readings were per- 
formed at 340 nm using deionized water as the control. 25 pi of sample, standard and blind 

25 was dispensed into a microtiter plate. The micro-titer plate was inserted into an \BUB MP 
reader (Labsystems, Finland) and 200}jl of OPA reagent was automatjcaliy dispensed. Plates 
were shaken (2 mtn; 700 rpm) before measuring absorbance. Finally, the DH was calculated. 
Eightfold determination of all samples was carried out. 

30 Estimation of soiubillsed and digested protein 

Jh& content of soiubillsed protein in supematants from in vitro digested samples was 
estimated by quantifying mide protein (CP) using gei filtration HPLC. Supematants were 
thawed, filtered through 0.45 pm polycarbonate filters and diluted (1:50. v/v) with H2O, Di- 
luted samples were chromatographed by HPLC using a Superdex Peptide PE (7.5 x 300 

35 mm) gel filtration column (Amersham Biosciences, Sweden). The eluent used for isocratic 
elution was 50 mU sodium phosphate buffer (pH 7.0) containing 150 mlVl NaCl. The total 
volume of elueni per run was 26 ml and the flow rate was 0.4 ml/min. Elution profiles were 
recorded at 214 nm and the total area under the profiles was determined by integration. To 
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estimaie protein content from Integrated areas, a calibration cuw© (R^-0.9093) was made 
from a dilution series of an fn vitro digested reference maize/-SBM sample with known tota! 
protein content. The protein determination in this reference sample was earned out using tiie 
KJeidahf method (detemfiination of % nitrogen; A.OAC. (1984) Official Methods of Analysis 
5 14tii ed., Washington DC). 

The content of digested prcstein was estimated by integrating the chromatogram area 
corresponding to peptides and amino acids having a molecuiar mass of 1500 Daiton or be- 
low {Savoie,L; Gauthter.S.F. Dialysis Celi For The in-vitro Measurement Of Protein Digesti- 
bility. J, Food Sci. 1986, 51. 494-498; Babinszky.L.; Van,D.fvU,M,; Boer,H.; Den.H.LA, An 
10 In-vitro tvlethod for Prediction of The Digestible Crude Protein Content in Pig Feeds. J. Sci. 
Food Agr. 1900, SO, 173-178; Bolsen.S.; £ggum,B-0. Critical Evaluation of In-vitro Methods 
for Estimating Digestibility in Simpie-Stomach Animals. Nutrition Research Reviews 1891, 4, 
141-162). To determine the 1500 Daiton dividing line, the gel filtration column was calibrated 
using cytochnome C (Boehringer, Gemiany), aprotinin, gastrin I, and substance P (Sigma Ai- 
ts drich, USA), as molecular mass standards. 

Remjits 

The results shown in Tables 4 and 5 below indicate that the protease increased the 
Degree of Hydrolysis {DH), as well as soluble and digestible protein significantly, 

20 

Table 4: Degree of Hydroiysis fPH) 







Of total protein 


Relative to blank 


Enzyme [m^EP/kg] 


n 


%DH 


SD 






%cv 


Blank 


5 


25.1 i 


0.6 


100.0 


a 


2.4 


Tiie Protease of the In- 
vention |1 00] 


5 


1 

27.4 1 


0.9 


109.3 


b 


3,1 



DtiTerent letters within the same coSumn indicate signiJicant differences (1-way ANOVA, Takey-Kramer test, 
P<0.05). SD ~ Standard Deviation. %CV « Cosffictent of Vsriance « {SD/mean vatuo) x 100% 
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and diaested crude protein 








Oi 


F total protein 




Relative to blank 


Enzyme (mg 
EP/kg| 


t1 


%dig. CP 


SD 




%sol.CP 


SD 


%dig.CP 


cv% 




%so!,CP 


CV% 


Blank 




fi1 0 




0,6 




Q1 0 




0,6 






0.8 




100 0 


s 


0.6 


The Protease 
of ttie inven- 
tion 
[251 


5 


63.4 




0.2 




93,8 




0,7 


104.0 




0,3 


b 


103,1 


b 


0.7 


The Protease 
of tiie inven- 
tion 
[501 


5 


63,1 




0.6 




93,6 




0.6 


103.6 




0.9 


b 


102.8 


b 


0.7 


The Protease 
of the inven- 
tion 
[100] 


5 


65.4 




0.3 




95.3 




0.8 


107.2 




0.4 


C 


104.7 


0 


0.8 



Different iettei-s within the same coiumr, indicate significant differences (SAS MP, ail pair wse, Tukey). {1-way 
ANOVA, Tukey-Kramer test, P<0.05). SD = SJarwlafd DeviaSon, %CV ™ Coefficlertt of Variance ~ {SD/mmn 
value) X 100% 
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1 . An isolated polypeptide having protease activity, selected from the group consisting 
of: 

5 (a) a polypeptide having an amino acid sequence which has a degree of identity to amino 
acids 1 to 188 of SEQ !D HO: 2 of at least 98.0%; 

(l^) a polypeptide wliicls is encoded by a nucleic acid sequence wliich hybridizes under 

medium-high stringency conditions with 

(i) nucleotides 496-1059 of SEQ ID NO: 1 
10 (ii) a subsequence of (i) of at least 100 nucleotides, and/or 

(tii) a complementary strand of {i), (it), or (iii); 

(c) a variant of the polypeptide having an amino add sequence of amino acids 1 to 188 

of SEQ !D HO: 2 comprising a substitution, defetlon, extension, and/or insertion of one or 

more amino acids; 
15 (d) an aileijc variant of (a), (b) or (c); and 

(e) a fragment of (a), (b), {c), or (d) that has protease activity. 

2. An isolated nucleic acid sequence comprising a nucietc acid sequen(^ which 
(a) encodes the polypeptide of claim 1 ; 

20 {b) encodes a polypeptide having protease activity, and which hybridizes under rnedtum- 
high stringency conditions with 

(i) nucleotides 496-1 069 of SEQ iD NO: 1 , 

(ii) a subsequence of (i) of at least 100 nucleotides, and/or 
(if)} a complementary strand of (i), (ii), or (iii); and/or 

25 (c) encodes a polypeptide having protease acti>^ty and vvhidi has a degree of identity to 
nucleotides 496-1059 SEQ ID NO; 1 of at least 96.9%. 

3. A nucleic acid consfruct comprising the nudeic add sequence of claim 2 operably 
linked to one or more control sequences that direct the production of the polypeptide in a 

30 suitable expression host. 

4. A recombinant expression vector compnsing the nucieic acid construct of claim 3, 

5. A recombinant host cell comprising the nucieic acid construct of claim 3 or the vector 
35 of ciaim 4. 
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6, A method for producing a polypeptide of claim 1, the method comprising: (a) 
cultivating a recombinant host ceil of claim 5 to produce a supernatant comprising the 
polypeptide; and (b) recovering the polypeptide, 

5 7, A transgenic plant, or plant part, capabie of e>q3ressing the polypeptide of claim 1 > 

8. A transgenic, non-human animal, or products, or elements thereof, being capable of 
expressing the polypeptide of claim 1 . 

10 9, Use of at least one polypeptide as defined in claim 1 (i) in animal feed; (ii) in the 
preparation of a composition for use in animal feed; (iil) for improving the nutritional value of 
an animal feed; (tv) for increasing digestibie and/or soluble protein in animal diets; (v) for in- 
creasing the degree of hydrolysis of proteins in animal diets; and/or (vi) for the treatment of 
proteins. 

16 

10. An animal feed additive comprising at least one polypeptide as defined in claim 1; 
and 

(a) at least one fat-soluble vitamin, and/or 

(b) at least one water-soluble vitamin, and/or 
20 (c) at least one trace mineral, 

11 An animal feed composition having a crude protein content of 50 to 800 g/kg and 
comprising at least one polypeptide as defined In claim 1, or at ieast one feed additive of 
claim 10, 

25 

12. A composition comprising at least one polypeptide as defined in claim 1 , together with 
at least one other enzyme selected from amongst phytase {EC 3.1 .3,8 or 3.1 .3.26); xylanase 
(EC 3.2,1,8); galactanase (EC 3.2.1,89); alpha-galactosidase {EC 3,2.1.22); protease (EC 
3.4.-.-), phosphollpase A1 (EC 3.1.1.32); phospholipase A2 {EC 3,1.1.4); lysophospholtpase 

30 (EC 3.1.1.5); phospholipase C {3.1.4.3); phosphollpase D (EC 3.1.4.4); amylase, and/or 
faeta-glucanase (EC 3.2.1 .4 or EC 3.2.16). 

13. Use of at least one polypeptide as defined in daim 1 in detergents. 
35 14. Nocardiopsis pmsina QBM iBMS. 
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10477, 204-kfO,ST25 
SEQUENCE LISTING 

<110> Novozyines A/S 

<120> Proteases 
<130> 10477. 204-WO 
<160> 4 

<170> patentln version 3.2 

<210> 1 

<211> 1062 

<212> ONA 

<213?^ Nocardiopsis prasina DSM 15648 

<220> 

<221> CDS 

<222> a)..ao59) 

<220> 

<2 2 1> Riai_pepti de 
<222> C4965..aOS9) 

<400> 1 

gcc acc gga ccg etc ccc cag tea ccc acc ccg gag gcc gac gcc 45 
A{a Thr Giy Pro Leu Pro Gin Ser Pro Thr pro Glu Ala Asp Ala 
-165 -160 -155 

gtc tec atg cag gag gcg etc cag cgc gac etc gqc ctg acc ccq 90 
Val Ser Met Gin <3lu Ala Leu Gin Arq Asp Leu aly Leu Thr Fro 
-150 -145 -140 

ctt gag gcc gat gaa ctg ctg qcc cjcc cag gac acc gcc ttc gac '135 
Leu Glu Ala Asp Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu 
-135 -130 -125 



gtc gac gag gcc gcg gcc gcg gcc gcc ggg gac gcc tac ggc ggc 
val Asp Glu Ala Ala Ala Ala Ala Ala Giy Asp Ala xyr Giy Giy 
-120 -115 -110 



ggt gac acc gtc gtc ctg gag gtc ctg gag gqt tec gga gcc gac gtg 
Giy Asp Thr Val Val teu Glu Val Leu Glu Giy ser Giy Ala Asp Val 
-40 -35 -30 



180 



tec gtc ttc gac acc gag acc ctg qaa ctg acc gtc ctg gtc acc gac 228 
Ser Val Phe Asp Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp 
-105 -100 -95 -90 

gcc gcc teg gtc gag get gta gag gcc acc ggc gcg ggt acc gaa etc 276 
Ala Ala ser val Glu Ala VaT Glu Ala Thr Giy Ala Giy Thr Glu Leu 
-85 -SO -75 

gtc tec tac ggc ate gag ggc etc gac gag ate ate cag gat etc aac 324 
val ser Tyr Giy lie Glu Giy Leu Asp Glu lie lie Gin Asp Leu Asn 

-70 -65 -60 

QCC gcc gac gcc gtc cee ggc gtg gtc ggc tgg tac ccg gac gtg gcg 372 
Ala Ala Asp Ala val Pro Giy val val Giy Trp Tyr pro Asp Val Ala 
-55 -50 -45 



420 



age ggc ctg etc gcc gac gcc ggc gtg gac gcc teg gcc gtc gag gtg 468 
Ser Giy Leu Leu Ala Asp Ala Giy val Asp Ala ser Ala Val Glu val 
-25 -20 -15 -10 

acc age agt gcg cag ccc gag etc tac gcc gac ate ate ggc ggt ctg 516 
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10477 . 204-vVO. ST25 
Thr Ser ser Ala Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu 

gcc tac acc atg ggc ggc cgc tgt teg gtc gga ttc gcg gcc acc aac 564 
Ala Tyr Thr Met Gly Giy Arg cys Ser val Gly Phe Ala Ala Thr Asn 
10 15 20 

gcc gcc ggt cag ccc gga ttc gtc acc gcc ggt cac tgt ggc cgc gtg 612 
Ala Ala Gly Gin pro Gly Phe val Thr Ala Gly His Cys Gly Arg vat 
25 30 35 

ggc acc cag gtg age ate ggc aac ggc cag ggc gtc ttc gag cag tec 660 
Gly Thr Gin Val Ser He Gly Asn Gly Gin Gly Val Phe Glu Gin Ser 
40 45 50 55 

ate ttc ceg ggc aac gac gcc gcc ttc gtc cgc ggc acg tec aac ttc 708 
He Phe Pro Gly Asn Asp Ala Ala Phe val Arg Gly Thr Ser Asn Phe 
60 65 70 

acg ctg acc aac ctg gtc age cgc tac aac acc ggc ggt tac gcc acc 755 
Thr Leu Thr Asn Leu val ser Arg Tyr Asn Thr Gly Giy Tyr Ala Thr 

75 SO 85 

gtc gcc ggc cac aac cag gcg ccc ate ggc tec tec gtc tgc cgc tec 804 
Val Ala Gly His Asn Gin Ala Pro lie Gly ser ser val cys Arg ser 
90 95 100 

ggc tec acc acc ggc tgg cac tgc ggc acc ate cag gcc cgc ggc cag 852 
Gly ser Thr Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Gly Gin 
105 110 115 

teg gtg age tac ccc gag ggc acc gtc acc aac atg acc egg acc acc 900 
Ser Val ser Tyr Pro Glu Gly Thr val Thr Asn Met Thr Arg Thr Thr 
120 125 130 135 

gtg tgc gcc gag ccc ggc gac tec ggc ggc tec tac ate tec gqc aac 
Val Cys Ala Glu Pro Gly Asp ser Gly Gly Ser Tyr i1e ser Gly Asn 
■ 140 145 150 

cag gcc cag ggc gtc acc tec ggc ggc tec gac aac tgc cgc acc ggc 996 
Gin Ala Gin Gly val Thr Ser Giy Gly ser Gly Asn Cys Arg Thr Gly 
155 160 165 



ggg acc acc ttc tac cag gag gtc acc ccc atg gtg aac tec tgg ggc 
Gly Thr Thr Phe Tyr Gin Glu val Thr Pro Met val Asn Ser Trp Gly 
170 175 180 



1044 



gtc cgt etc egg acc taa 1062 
val Arg Leu Arg Thr 
185 

<210> 2 

<2U> 353 

<212> PRT 

<213> Nocardiopsis praslna OSM 15648 

<400> 2 

Ala Thr Gly Pro Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala 
-165 -160 -155 

val ser jMet Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro 
-150 -145 -140 

Leu Glu Ala Asp Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu 
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10477.204~WO.ST25 
-135 -130 -125 

Val Asp <3lu Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr 6ly Gly 
-120 -115 -110 

ser Val Phe Asp Thr 61 u Thr Leu slu Leu Thr val Leu val Thr Asp 
-105 -100 -95 -90 

Ala Ala ser val Glu Ala val Glu Ala Thr Gly Ala Gly Thr Glu teu 
-8S -80 -75 

Val Ser Tyr Gly He Glu Gly Leu Asp <fio lie He Gin Asp Leu Asn 
-70 -65 -60 

Ala Ala Asp Ala Val Pro Gly Val Val Cly Trp Tyr Pro Asp Val Ala 
-55 -50 -45 

Gly Asp Thr Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val 
-40 -35 -30 

Ser Gly Leu Leu Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu val 
-25 -20 -15 -10 

Thr Ser Ser Ala Gin pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu 
-5 -11 5 

Ala Tyr Thr Met Gly Gly Arg Cys Ser val Gly Phe Ala Ala Thr Asn 
10 15 20 

Ala Ala Gly Gin pro Gly Phe val Thr Ala Gly His cys Gly Arg val 
25 30 35 

Gly Thr Gin Val Ser He Gly Asn Gly Gin Gly Val Phe Glu Gin Ser 
40 45 50 55 

lie Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr ser Asn Phe 
60 65 70 

Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 
75 80 85 

Val Ala Gly His Asn Gin Ala Pro lie Gly Ser ser Val Cys Arg Ser 
90 95 100 

Qly ser Thr Thr Gly Trp His Cys Gly Thr lie Gin Ala Arg Gly Gin 

X05 110 115 

Ser Val Ser Tyr Pro €lu Gly Thr val Thr Asn net Thr Arg Thr Thr 
120 125 130 135 

val cys Ala Glu pro Gly Asp ser Gly Gly ser Tyr lie ser Gly Asn 
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140 



10477.204-WO.ST25 
145 



150 



Gin Ala Gin Gly Val Thr ser Gly Gly Ser Gly Asn Cys Arg Thr Gly 
155 160 165 



Gly Thr Thr Phe jyr gIr gIu Val Thr Pro Met val Asn ser rrp Gly 
170 175 180 



val Arg Leu Arg Thr 
1S5 



<210> 3 

<211> 43 

<212> DNA 

<213> syntheti c 



<220> 

<221> misc_feature 
<223> Primer 

<400> 3 

gttcatcgat cgcatcggct gccaccggac cgctccccca gtc 43 



<210> 4 

<211> 38 

<;212> DNA 

<213> Synthetic 



<220> 

<221> misc„feature 
<223> Primer 



<400> 4 

geggatccta ttaggtccgg agacggacgc cccaggag 
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